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FOREWORD 


The  underlying  purpose  of  this  report  is  to 
present  an  objective  evaluation  of  several  techniques  for 
adaptively  controlling  and  guiding  tactical  missiles. 
Because  design  trade -offs  always  exist  between  perform¬ 
ance  and  control  system  complexity,  there  is  probably 
no  one  control  method  that  is  preferable  for  all  applica¬ 
tions.  Consequently,  in  this  work  no  single  method  is 
advocated  as  the  panacea  for  all  missile  design  problems. 
Insteadthe  discussion  emphasizes  distinguishing  charac¬ 
teristics  of  each  technique  so  the  reader  can  judge  which 
is  most  suitable  for  his  own  situation. 

A  by-product  of  this  research  effort  is  an 
organized,  unified  discussion  of  many  technical  aspects 
of  adaptive  control  which  have  heretofore  been  available 
only  in  isolated  papers.  New  research  results  produced 
by  this  investigation  are  also  included.  Therefore, 
although  this  study  has  been  performed  primarily  for 
tactical  missile  applications,  the  material  collected  here 
should  also  be  of  interest  to  those  working  in  other  areas 
where  adaptive  control  methods  are  needed. 

The  authors  are  grateful  for  the  encourage¬ 
ment  and  support  provided  by  Mr.  David  Siegel  of  the 
Office  of  Naval  Research  and  Mr.  Paul  Blatt  of  the  Air 
Force  Flight  Dynamics  Laboratory.  Acknowledgement 
is  also  made  to  Professor  Richard  V.  Monopoli  of  the 
University  of  Massachusetts  for  his  contributions  rela¬ 
tive  to  Liapunov  design  techniques.  Helpful  assistance 
was  provided  in  several  technical  areas  by  Professor 
John  J.  Deyst,  Jr.  of  the  Massachusetts  Institute  of 
Technology  and  by  Dr.  Joseph  J.  Budelis.  Appreciation 
is  also  expressed  to  Professor  Wallace  E.  VanderVelde 
of  the  Massachusetts  Institute  of  Technology  for  his  help¬ 
ful  review  of  portions  of  the  document. 
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ABSTRACT 


The  fields  of  adaptive  control  and  guidance  are 
searched  for  techniques  that  can  be  beneficially  applied 
to  the  design  of  guidance  systems  for  tactical  missiles,, 
A  large  number  of  existing  adaptive  control  techniques 
are  investigated  and  new  methods  which  are  suited  to  the 
needs  of  missile  control  systems,  are  proposed.  The 
feasibility  of  promising  autopilot  design  procedures  is 
demonstrated  through  computer  simulations,  using  real¬ 
istic  time-varying  airframe  dynamics.  Guidance  tech¬ 
niques  for  tactical  missiles  are  also  reviewed  and  a 
number  of  steering  laws,  derived  from  optimal  control 
theory,  are  evaluated.  Quantitative  comparisons  are 
made  between  different  guidance  laws  on  the  basis  of 
intercept  accuracy  and  control  effort  expended. 

The  report  is  published  in  two  volumes  con¬ 
taining  four  basic  parts  —  Introduction  (which  includes 
the  summary  and  conclusions  for  the  entire  report), 
Adaptive  Control  Theory,  Adaptive  Control  Applications, 
and  Guidance.  The  first  two  parts  constitute  Volume  I 
and  the  remainder  together  with  several  appendices 
compose  Volume  II. 
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8.  APPLICATIONS:  PARAMETER  ADAPTIVE  CONTROL 

SYSTEMS  WITH  IMPLICIT  PLANT  IDENTIFICATION 


In  this  chapter  two  adaptive  control  techniques  described  in 
Chapter  4  —  the  accelerated  gradient  and  Liapunov  design  methods  —  are 
considered  for  tactical  missile  autopilot  design.  Our  primary  goal  is  to 
demonstrate  the  feasibility  of  these  methods  in  terms  of  resulting  missile 
performance.  The  accelerated  gradient  method  is  applied  to  both  adaptive 
pitch  rate  and  normal  acceleration  autopilots  and  the  Liapunov  method  is 
used  for  pitch  rate  control. 


8 . 1  DE  SIGN  CONSIDE  RATIONS 

hi  designing  any  control  system,  adaptive  or  not,  one  begins 
with  a  mathematical  model  for  the  plant  and  specified  performance  cri¬ 
teria.  We  first  discuss  these  aspects  of  the  problem  formulation  for  tacti 
cal  missiles. 


8.1.1  Airframe  Dynamics 

The  rotational  motion  of  an  airframe  is  generally  described  in 
terms  of  6  state  variables  —  three  angles  and  three  angular  rates.  The 
exact  differential  equations  of  motion  for  these  variables  are  nonlinear  and 
are  also  coupled  to  the  translational  motion  of  the  airframe  through  such 
quantities  as  air  speed  and  altitude  which  describe  the  missile’s  flight  con¬ 
dition.  As  is  mentioned  in  Section  2.1,  the  coupling  between  translation 
and  rotation  is  simplified  if  the  former,  being  affected  primarily  by  the 
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relatively  long  response  time  of  the  guidance  loop,  is  regarded  as  being 
independent  of  the  autopilot  characteristics.  Consequently,  with  respect 
to  the  rotational  equations  of  motion,  the  variables  which  define  the  flight 
condition  can  be  regarded  as  time-varying  quantities  that  are  independent 
of  the  missile  airframe  response  characteristics.  The  dimensionality  of 
the  rotational  equations  is  usually  reduced  by  considering  them  to  be 
separable  into  three  uncoupled  motions  —  pitch,  yaw,  and  roll  --  each  of 
which  is  controlled  separately.  In  a  cruciform  missile  pitch  and  yaw 
dynamics  are  identical,  *  both  being  used  to  respond  to  steering  commands, 
and  the  function  of  the  roll  control  system  is  primarily  one  of  stabilization. 
After  these  simplifications  there  still  remain  nonlinearities  in  the  depend¬ 
ence  of  the  equations  of  motion  upon  angle  of  attack  and  control  surface 
deflection  angle;  linearization  is  achieved  by  employing  small  angle 
approximations. 

In  this  report  we  are  interested  not  only  in  autopilot  design,  but 
also  in  the  relationship  of  the  autopilot  to  the  overall  performance  of  the 
guidance  system.  The  latter  is  investigated  by  assuming  the  missile’s 
translational  motion  is  confined  to  a  single  plane,  taken  to  be  the  pitch 
plane.  Consequently  the  autopilot  governing  pitch  motion  which  receives 
the  steering  commands  is  most  relevant  for  this  study. 

The  mathematical  model  used  here  to  describe  pitch  motion  is 
for  a  missile  which  develops  control  forces  through  aerodynamic  lift  pro¬ 
vided  by  fixed  wings  or  by  the  missile  body,  with  the  aid  of  tail-mounted 
control  surfaces.  This  is  currently  the  most  common  configuration. 


$ 

In  the  discussion  that  follows  the  effects  of  gravity  are  neglected. 
However,  the  text  speaks  of  ’’pitch”  dynamics  in  the  "pitch”  plane, 
recognizing  that  in  a  strict  sense  it  is  the  horizontal  plane  and 
yaw  dynamics  that  are  described. 
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The  model  assumes  second  order  airframe  and  first  order  actuator 
dynamics  with  equations  of  motion  given  by 

q(t)  =  Mq  q(t)  +  a(t)  +  Mg  6(t) 

d(t)  =  q(t)  -  L  a(t)  -  L  6(t) 
a.  o 

6(t)  =  -  X  6(t)  +  X  u(t) 

a(t)  =  -  V[d(t)  -  q(t)]  (8.1-1) 


where  M  ,  M  ,  L  ,  and  L.  are  stability  derivatives  and 
q’  a.  b’  a  6 


q(t)  =  pitch  rate 
a(t)  =  normal  acceleration 
6(t)  =  control  surface  deflection 
V  =  air  speed 


a(t)  =  angle  of  attack 
u(t)  =  control  command 
-  X  -  actuator  pole 


Refer  to  Fig.  8. 1-1  for  a  geometrical  definition  of  autopilot  state  variables. 
The  stability  derivatives  can  be  expressed  analytically  in  terms  of  aero¬ 
dynamic  coefficients,  airframe  parameters,  airspeed,  and  dynamic  pres¬ 
sure  by  the  relations 
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Figure  0.1-1 


Geometrical  Definitions  of  Pitch 
Plane  State  Variables 


where 


q 

I 

yy 

s 

d 

CM  ,CM  ,CM 
q  a  6 


dynamic  pressure 

moment  of  inertia  about  pitch  axis 

lifting  surface  area 
characteristic  length 
moment  coefficients 

normal  force  coefficients 


m  =  mass 
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form 


The  equations  of  motion  can  be  expressed  in  the  vector-matrix 
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(8.1-3) 

or,  in  abbreviated  notation, 
x(t)  =  Ax(t)  +  bu(t) 
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(8.1-4) 


To  design  an  adaptive  autopilot  using  the  techniques  suggested  in 
Chapter  4  we  augment  Eq.  (8. 1-4)  with  the  equations  of  motion  for  a  known 
reference  model  that  has  desired  output  response  characteristics.  The 
total  system  is  illustrated  in  Fig.  8.1-2  and  is  described  by  the  relations 


x  =  Ax(t)  +  bu(t) 

y(t)  =  cTx(t; 

m(t)  =  Hx(t) 

u(t)  =  v(t)  -  r(t) 

x  (t)  =  A  x  (t)+b  v(t) 

— m  m— m  —  m 

ym(t)  *  ^T£m(t) 

e(t)  =  y(t)  -  ym(t) 

(8.1-5) 
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f-itri 


Figure  8. 1-2  Input-Output  Relations  for  Autopilot 
Design  Problem 


where  xm(t)  is  the  state  of  the  reference  model  having  time-invariant 
dynamics  described  by  Am  and  b>m,  y(t)  is  the  output  of  the  autopilot  which 
is  to  be  compared  with  the  model  output  ym(t),  e(t)  is  the  output  error  sig¬ 
nal,  m(t)  is  the  set  of  measurements  available  for  use  by  the  controller, 
v(t)  is  the  steering  command  input,  and  r(t)  is  that  portion  of  the  control 
command  which  can  be  specified  by  the  designer.  The  overall  objective 
is  to  choose  r(t)  so  that  the  output  error  signal  remains  small. 

Another  expression  that  is  useful  for  describing  the  airframe 
dynamics  is  the  input-output  transfer  function  relating  y(t)  and  u(t),  given  by 
by 

=  £T  (Is  -  A)-1  b  (8.1-6) 

* 

In  Section  8.2.4  it  is  demonstrated  that  an  adaptive  (time -varying) 
reference  model  has  advantages  for  missile  control. 
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where  Y(s)  and  U(s)  are  the  Laplace  transforms  of  y(t)  and  u(t)  and  it  is 
assumed  that  A  and  b  are  time  invariant.  For  the  applications  being  con¬ 
sidered,  specific  expressions  for  c  are 


c  =  [  1  0  0]  =»  y(t)  =  q(t) 


c  =  [  0  1  0]  =&  y(t)  =  a(t) 


Substitution  of  Eqs.  (8.1-4)  and  (8. 1-7)  into  Eq.  (8. 1-6)  produces 


(8.1-7) 
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^  (s  +  X)  (s  -  (an  +  a22)  s  +  an  a22  “  al2  a2l) 


(8.1-8) 


where  Q(s)  and  A(s)  denote  the  transforms  of  q(t)  and  a(t). 


The  above  mathematical  description  of  the  missile  airframe  omits 
contributions  from  structural  bending  modes  and  sensor  dynamics.  This 
keeps  the  control  system  sufficiently  simple  to  permit  the  scope  of  this  study 
to  include  several  adaptive  techniques,  but  retains  enough  detail  to  permit 
realistic  conclusions  about  control  of  the  dominant  response  characteristics. 
In  a  complete  missile  control  system  design,  any  high  order  effects 
neglected  in  synthesizing  the  autopilot  should  be  included  in  simulation 
evaluations  of  performance. 

In  order  to  evaluate  various  control  schemes,  a  set  of  typical 
missile  dynamics  is  required  for  performing  computer  simulations ;  it 
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should  include  the  time -varying  character  of  airframe  parameters  along 
the  flight  path.  Two  sets  of  such  data  are  given  in  Appendix  H,  repre¬ 
senting  air -to -surface  and  surface-to-air  types  of  trajectories. 


8. 1. 2  Performance  Criteria 


In  designing  an  adaptive  system  care  must  be  exercised  in 
choosing  a  set  of  reference  model  dynamics  and  an  output  signal  y(t).  One's 
first  thought  may  be  to  designate  any  conveniently  measured  state  variable 
as  the  output  variable  and  select  a  reference  model  having  natural  frequen¬ 
cies  such  that  the  model  output  ym(t)  has  a  desirable  transient  response. 
Then  the  controller  would  be  designed  to  make  the  error  between  the  output 
variables  small,  according  to  some  adaptive  procedure.  However,  if  this 
reasoning  is  applied  too  loosely  the  result  can  be  an  adaptive  control  sys¬ 
tem  with  exactly  the  wrong  performance  characteristics.  This  point  will 
be  elaborated  as  we  discuss  a  proper  design  procedure  for  the  missile 
application. 

From  a  functional  point  of  view,  the  missile  autopilot  exists  only 
to  serve  the  guidance  objectives.  Consequently  performance  criteria  for 
the  control  system  should  be  based  upon  what  is  required  by  the  steering 
commands.  In  Chapter  11  it  is  demonstrated  that  most  effective  guidance 
laws  use  a  normal  acceleration  steering  command  so  that  the  autopilot  out¬ 
put  variable  of  interest  is  normal  acceleration,  a(t)  in  Eq.  (8.1-3).  Thus 
the  reference  model  specifications  should  be  such  that  the  model's  normal 
acceleration  am(t)  has  desired  response  characteristics  to  v(t)  and  the 
adaptive  controller  should  attempt  to  null  the  error 

e(t)  =  a(t)  -  am(t) 


8-8 


THE  ANALYTIC  SCIENCES  CORPORATION 


That  is  to  say,  an  adaptive  normal  acceleration  autopilot  is  desired. 

For  reasons  related  to  the  adaptation  properties  of  the  system, 
which  are  discussed  more  fully  in  the  sequel,  it  is  sometimes  tempting  to 
try  achieving  acceleration  control  indirectly  by  using  a  state  variable  other 
than  normal  acceleration  to  define  the  output  of  the  autopilot.  Specifically, 
we  know  from  Eq.  (8. 1-1)  that  constant  values  of  pitch  rate,  and  accelera¬ 
tion  are  related  simply  by 


a  =  qV  (8.1-9) 

This  represents  the  steady  state  airframe  response  to  a  step  input.  If 
an  estimate  of  air  speed  is  available,  a  constant  pitch  rate  command 
can  be  applied  which  yields  the  desired  constant  acceleration  after  the 
transient  has  subsided.  Consequently  one  might  consider  implementing 
an  adaptive  pitch  rate  autopilot,  even  though  acceleration  is  the  quantity  of 
importance,  in  the  expectation  that  the  acceleration  and  pitch  rate  transient 
responses  would  have  similar  response  times.  Unfortunately,  this 
approach  can  be  incorrect,  depending  upon  the  type  of  missile  being 
employed. 


To  understand  the  potential  fallacy  of  the  technique  described 
above,  one  must  consider  the  dynamic  relationship  between  pitch  rate  and 
normal  acceleration  for  the  particular  missile  under  consideration.  If  the 
airframe  depends  upon  aerodynamic  forces  for  developing  the  lift  required 
to  produce  normal  acceleration,  the  dependence  between  a(t)  and  q(t) 
changes  with  flight  condition  (FC).  To  illustrate  this  fact  consider  an  air¬ 
frame  modelled  by  Eq.  (8.1-3)  and  suppose  u(t)  is  given  by 

u(t)  =  v(t)  -  kjq(t)  -  kga(t)  -  k3§(t) 
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with  the  set  of  constant  gains  kj,  kg,  and  k^  chosen  at  different  flight 
conditions  so  that  the  resulting  system  poles  have  specified  values.  This 
can  always  be  done  because  the  system  is  controllable  (Ref.  30).  In  par¬ 
ticular,  choose  two  sets  of  gains  so  that  the  system  poles  are  at  -40, 

-5  +j5,  -5  -  j5  in  both  flight  conditions  4(high  altitude)  and  6(low  altitude) 
of  Tables  H.  i  and  H. 2  and  apply  a  step  pitch  rate  command  v(t)  which 
yields  a  normal  acceleration  of  28.0  feet/sec^  in  the  steady  state.  The 
resulting  step  responses  are  shown  in  Fig.  8. 1-3.  Notice  that  the  settling 
times  of  the  system  are  about  the  same  for  both  flight  conditions  tx  cause 
the  system  closed  loop  poles  are  the  same  for  both  cases.  However,  the 
overshoot  in  pitch  rate  is  considerably  different  in  the  two  cases,  that  for 
condition  4  being  considerably  larger.  The  physical  reason  for  this  be¬ 
havior  is  that  the  missile,  being  aerodynamically  controlled,  must  develop 
a  larger  angle  of  attack  at  FC  4  than  at  FC  6  to  achieve  the  same  normal 
acceleration,  primarily  because  the  dynamic  pressure  is  lower.  More 
specifically,  in  the  case  illustrated  here,  the  velocities  are  nearly  the 
same  at  both  flight  conditions  but  the  altitudes  are  considerably  different. 
At  the  higher  altitude  (FC  4)  the  air  density  is  lower  so  that  the  lifting  sur¬ 
faces  are  less  effective  than  at  low  altitudes;  thus  a  larger  angle  of  attack 
is  required  to  achieve  the  same  lifting  force.  This  reasoning  implies  that 
the  pitch  rate  must  have  a  greater  peak  value  if  the  settling  time  is  to  be 
the  same  at  both  flight  conditions. 

The  above  example  illustrates  that  the  physics  of  aerodynamic 
missile  control  demands  a  wide  variation  in  pitch  rate  transient  response 
as  plant  dyna*.  *cs  change,  if  the  settling  time  of  the  normal  acceleration 
response  is  to  remain  constant.  Consequently  if  one  should  try  to  obtain 
uniform  pitch  rate  response,  just  the  opposite  result  will  be  achieved; 
i.e. ,  the  acceleration  transient  response  will  vary  widely  over  different 
flight  conditions.  This  behavior  is  rather  simply  described  analytically  with 
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Figure  8. 1-3  Pitch  Rate  and  Normal  Acceleration  Step 
Responses  for  Two  Flight  Conditions 


the  aid  of  the  diagram  in  Fig.  8.1-4.  Suppose  an  adaptive  autopilot  can  be 
designed  so  that  at  all  flight  conditions  the  pitch  rate  response  to  a  command 
input  v(t)  is  described  by  a  fixed  reference  model  transfer  function,  Tm(s). 
However,  the  dynamic  relationship  between  pitch  rate  and  normal  accelera¬ 
tion  is  deduced  from  Eq.  (8. 1-8), 


(8.1-10) 
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Figure  8.1-4  Illustration  of  the  Effect  of  a  Pitch 
Rate  Adaptive  Autopilot  on  Normal 
Acceleration  Response 


Evidently  the  latter  transfer  function  is  unaffected  by  the  adaptive  con¬ 
troller  and  is  sensitive  to  variations  in  the  elements  of  A  and  b  in  Eq. 
(8.1-4).  Consequently  the  acceleration  response  to  the  input,  given  by 


A(s) 

W) 


=  T  (s) 
mv 


A(s) 


(8.1-11) 


is  also  subject  to  changes  in  its  parameters.  The  most  important  effect 
is  the  single  pole  in  Eq.  (8.1-10)  which  tends  to  be  smaller  in  magnitude 
at  high  altitudes  and/or  low  air  speed  (FC  4  in  Fig.  8. 1-3)  than  it  is  at 
low  altitudes  and/or  high  air  speed  (FC  6  in  Fig.  8. 1-3).  The  resulting 
effect  on  normal  acceleration  response  is  a  variation  in  settling  time  as 
flight  conditions  change;  this  is  demonstrated  subsequently  in  simulation 
results. 
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The  conclusion  is  that  an  adaptive  normal  acceleration  autopilot 
is  required  for  aerodynamically  controlled  missiles.  When  other  mech¬ 
anisms  which  are  independent  of  aerodynamic  forces,  such  as  thrust  vec¬ 
toring,  are  used  for  developing  control  force,  it  is  possible  that  a  pitch 
rate  autopilot  will  yield  satisfactory  performance.  In  any  case,  the  designer 
designer  must  keep  in  mind  that  normal  acceleration  is  actually  the  output 
variable  of  interest  and  ensure  that  its  response  characteristics  are  cor¬ 
rect  for  whatever  control  scheme  is  devised. 

Many  adaptive  techniques  developed  or  proposed  in  the  past  have 
been  concerned  with  controlling  aircraft  airframe  pitch  rate  or  roll  motion. 
Little  attention  has  been  given  to  the  problem  of  designing  an  adaptive 
normal  acceleration  autopilot  using  the  methods  described  in  Chapter  4„ 

Both  pitch  rate  and  acceleration  control  of  missiles  are  investigated  in 
subsequent  sections. 


8.2  APPLICATION  OF  THE  ACCELERATED  GRADIENT  METHOD 

Having  outlined  requirements  for  autopilots  in  Section  8.1,  we 
now  consider  application  of  a  particular  adaptive  technique,  the  accelerated 
gradient  method  discussed  in  Section  4.3.  hi  this  section  the  design  pro¬ 
cedure  is  explained  in  detail  and  simulation  results  are  presented  for  both 
pitch  rate  and  normal  acceleration  autopilots. 


8.2.1  Design  Procedure:  General  Considerations 

The  design  procedure  for  an  accelerated  gradient  controller  con¬ 
sists  of  three  distinct  steps: 
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•  Define  the  controller  structure. 

•  Derive  the  simplified  gradient  adaptation  algorithm. 

•  Introduce  linear  compensation  in  the  adaptive  loop. 

The  Controller  Structure  —  The  equations  of  motion  are  given 
by  Eq.  (8. 1-5)  and  measurements  (or  estimates)  of  the  state  variables  are 
assumed  to  be  available,  i.e., 


m(t)  =  x(t) 

The  free  control  variable  r(t)  is  defined  to  be 

r(t)  =  (1  -  k(t))  v(t)  +  h(t)T  x(t)  (8.2-1) 

so  that  the  input  to  the  plant,  u(t),  becomes 

u(t)  =  k(t)  v(t)  -  h(t)T  x(t)  (8.2-2) 

where  k(t)  and  h(t)  are  adaptive  gains.  Thus  we  assume  all  three  state  vari¬ 
ables  —  pitch  rate,  normal  acceleration,  and  control  surface  deflection  — 
are  available  for  feedback  control.  This  particular  controller  structure 
is  chosen  so  that  any  values  of  the  closed  loop  poles,  which  are  the  eigen- 
values  of  the  matrix  (A-bh  ),  can  be  selected  by  proper  choice  of  h, 
given  A  and  b.  The  gain  k(t)  is  introduced  to  provide  independent  control 
over  the  d-c  gain  between  input  and  output. 

The  error  signal  e(t)  is  given  by 

e(t)  =  cT  (x(t)  -  £m(t))  (8.2-3) 
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where 


m  [10  0  ];  pitch  rate  autopilot 

£  =  (8.2-4) 

[0  1  0  ];  normal  acceleration  autopilot 

The  detailed  simulation  of  each  type  of  autopilot  is  considered  separately 
in  Sections  8.2.2  and  8.2.3. 

The  Simplified  Gradient  Adaptation  Algorithm  —  The  adaptive 
controller  for  the  autopilot  is  designed  by  applying  techniques  suggested 
in  Sections  4.2.4,  4.2.6,  and  4.3.  The  basic  adaptation  algorithm  is 
chosen  using  the  gradient  method  described  in  Section  4.2.4;  then  linear 
compensation  is  added  as  needed  in  the  adaptive  loop  to  improve  the  adap¬ 
tation  speed.  The  relations  for  the  simplified  gradient  adaptation  algo¬ 
rithm  are  given  by  Eqs.  (4.2-30)  and  (4. 2-37)  for  the  system  with  adaptive 
feedback  gains  h(t).  These  are  repeated  here  for  convenience  in  the  form 

lij(t)  =  p;  cT  (x(t)  -  xm(t)j  x.(t) 

pt/  =  ^  sign[^cT  AN_1  b  J  ;  1  =  1,. ,,n  (8.2-5) 

where  0/  is  an  adaptation  gain  whose  magnitude  0.  is  chosen  by  the  designer 
and  whose  algebraic  sign  is  specified  by  the  quantity  in  brackets.  The 
quantity  N  is  the  smallest  positive  integer  such  that  (cT  AN-1b)  is  nonzero. 

The  autopilot  equations  of  motion  are  the  same  form  as  Eq. 
(4.2-37)  except  for  the  additional  adaptive  gain  k(t)  in  Eq.  (8.2-2)  for  which 
an  adjustment  algorithm  must  be  provided.  This  can  be  obtained  from  the 
first  expression  in  Eq.  (4.2-27),  which  holds  for  any  adaptive  quantity 
appearing  in  the  differential  equation  for  x(t).  Thus 
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T  ax(t  f 
k(t)  =  -  ^  e(t)  c  — ^ 


(8.2-6) 


where  is  a  positive  gain  to  be  chosen  by  the  designer.  Evaluating  the 
partial  derivative  in  Eq.  (8.2-6),  one  obtains 


a  x(t)' 


=  A  bv(t) 


(8.2-7) 


where  N  is  the  smallest  integer  greater  than  or  equal  to  one  such  that 


eT  AN_1  b  +  0 


By  analogy  with  the  solutions  obtained  for  the  gains  let 


^  sign  [cT  AN_1  b  j 


(8.2-8) 


Now,  Eqs.  (8.1-5),  (8. 2- 1)  through  (8.2-3),  and  (8.2-5)  through  (8.2-8) 
are  combined  to  yield  the  equations  of  motion  of  the  gradient  -adaptive 
control  system; 


x(t)  =  (A  -  bh(t)T)  x(t)  +  b  k(t)  v(t) 


x  (t)  =  A  x  (t)  +  b  v(t) 
— m  m  — m  — m 


=  e(t)  x.(t);  i  =  1, . . .  ,n 


-  0^  e(t)  v(t) 
cT  ^x(t)  -  xm(t)  j 


(8.2-9) 
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where  c  is  given  by  one  of  the  expressions  in  Eq.  (8.2-4),  according  to 
which  type  of  autopilot  is  under  consideration. 

Adaptive  Loop  Compensation  —  Following  the  procedure  sug¬ 
gested  in  Eq.  (4.2-39)  of  Section  4.2. 6,  we  expand  h(t),  x(t),  and  k(t)  about 
nominal  values 


k(t)  =  kQ+6k(t)  h(t)  =  h^+Shft)  x(t)  =  xm(t)  +  6x(t) 


(8.2-10) 


and  linearize  Eq.  (8.2-9).  With  the  nonlinear  and  forcing  terms  neglected, 
the  result  is 


A-bhr 
— o 


B'x  (t)c 
-m'  - 


■k5mWT  !  bv(t) 


6x(t) 


(8.2-11) 


where  B  *  is  a  diagonal  matrix  whose  diagonal  elements  are  the  gains  0/. 

1 

Proceeding  in  exactly  the  same  way  used  to  derive  the  error  expression  in 
Eq.  (4. 2-48),  one  obtains 


E(s)  = 


0'  v^  +  23  0f  x^  G  (s)  E(s)^ 

k  i=l  1  mi  ow  '  7s 


(8.2-12) 


where 


G  (s)  =  cT  (Is  -  A  +  btJ )  b 


(8.2-13) 
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and  v(t)  and  xm(t)  are  considered  constants.  Recall  that  Gq(s)  is  just  the 
transfer  function  between  the  output  y(t)  and  the  modified  command  input 
k(t)  v(t)  with  the  nominal  feedback  gains  1^  implemented  in  the  controller., 

Equation  (8.2-12)  provides  a  means  for  analyzing  the  stability 
properties  of  the  gradient -adaptive  controller.  To  improve  its  converg¬ 
ence  properties,  we  apply  the  method  of  Section  4.3  which  introduces 
linear  compensation,  Gc^(s)  and  GC2(S)>  into  the  adaptive  loop  to  obtain  an 
accelerated  gradient  algorithm.  This  algorithm  is  the  same  as  Eq.  (4. 3-5) 
with  the  additions 

P^t)  -  ef(t)  v(t)  Wk(s)  =  Gc  (s)  Pk(s) 

2 

k(t)  *  -(3jwk(t)  (8.2-14) 

where  Pk(s)  and  Wfe(s)  denote  Laplace  transforms  of  pfc(t)  and  wk(t).  The 
new  error  equation  has  the  form 


[G  (s)  G  (b)  G  (s) 

O  c2 

S~~ 

. 


E(s) 


9  H  2 

+  E 

K  i=l  1  i 


(8.2-15) 


The  above  discussion  is  largely  a  brief  review  of  material  dis¬ 
cussed  in  Chapter  4  and  provides  the  framework  for  the  particular  appli¬ 
cation  described  in  the  next  three  sections. 
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8.2.2  A  Pitch  Rate  Autopilot 

The  first  application  we  consider  is  that  of  a  pitch  rate  autopilot 
for  the  missile  dynamics  given  in  Appendix  H,  Section  H.l.  For  this  case 
we  have 

cT  =  [1  0  0]  (8.2-16) 


in  Eq.  (8.2-9). 

Before  specifying  the  detailed  construction  of  the  controller,  the 
value  of  N  in  Eq.  (8.2-5)  must  first  be  determined.  Referring  to  Eqs. 
(8.1-3),  (8.1-4),  and  (8.2-16)  it  is  evident  that 

N  =  1:  cTb  =  0 

N  =  2:  cT  Ab  =  b2al2  +  b3al3  (8.2-17) 

Consequently  N  =  2  and  from  the  data  in  Table  H.  3  it  follows  that 

cT  Ab  <  0 

for  all  flight  conditions.  Therefore  Eqs.  (8.2-5)  and  (8.2-8)  imply  that 


n -  -fli; 

(3.  >  0 

Ki 

o 

A 

erf* 

(8.2-18) 

Combining  these  results  with  Eqs.  (8.2-9),  (4.3-5),  (8.2-14),  and 
(8.2-16)  one  obtains  the  following  set  of  expressions  for  a  pitch  rate 
autopilot: 
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Accelerated  Gradient  Adaptive  Controller 
for  a  Pitch  Rate  Autopilot 


X(t)  = 

(A  -  bh(t)^)  x(t) 

+  b  k(t)  v(t) 

A  x  (t)  +  b  v(t) 
m  — m  —  m 

e(t)  = 

Xi(t)  -  x  (t) 
l  m  j 

Ef(s)  = 

G  (s)  E(s) 
C1 

P^t)  = 

ef(t)  x.(t) 

Pk(t)  = 

ef(t)  v(t) 

W.(s)  = 

G  (s)  P  (s) 
c2  1 

wk(s)  - 

Gc  (s)  PkM 
2 

hj(t)  = 

-  gy*  w.(t) 

k(t)  = 

gykwk(t) 

where  i  =  1, . . . ,  n  and  we  have  made  the  definitions 

yi  "  Bi/e  >  0 

yk  %/g  >  0  (8.2-20) 


The  positive  quantity  g  is  introduced  to  provide  convenient  control  over  the 
adaptive  loop  gain  k  derived  from  Eqs.  (8.2-15),  (8.2-18),  and  (8.2-20); 


ke  =  -g 


2  A  2 

y.v  +  h  v.  x 
k  'l  mt 


<  0 


(8.2-21) 


At  this  stage,  the  designer’s  task  is  divided  into  four  well  defined  phases: 

•  Determine  the  required  adaptive  loop  compensation. 

•  Select  appropriate  specifications  for  the  reference 
model. 

•  Determine  the  adaptation  gains  ^  and  j3^,  i  =  1, . .  ,n. 

•  Evaluate  performance  with  simulations. 
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These  steps  are  completed  below,  for  the  specific  example  represented 
by  the  air-to-surface  trajectory  data  in  Section  H.  1. 

Adaptive -Loop  Compensation  —  To  determine  Gc^(s)  and  GC2(s), 
examine  the  root  locus  of  the  denominator  in  the  transfer  function 


Te(s)  - 


G  (s)  =  G  (s)  G  (s)  (8.2-22) 

c  c1  c2 


associated  with  Eq„  (8.2-15).  Recall  that  Gq(s)  as  given  by  Eq.  (8. 2-13) 
is  assumed  to  have  poles  identical  with  those  of  the  model  (see  Eq.  (4. 2-43)). 
Its  zeros  are  determined  by  evaluating  the  numerator  in  Eq.  (8. 2-13). 
Defining  the  elements  of  A  and  b  as  in  Eq.  (8. 1-4),  Gq(s)  is  obtained  in  the 
form 


go  =  “  (a33  al3  +  al2  a23^  z0  =  *'  al3  a33  a22/go  (8*2"23) 


where  Pm^,  pm2>  pm3  are  specked  reference  model  poles. 

Evidently  the  gain  gQ  and  zero  zq  of  Gq(s)  both  depend  upon  the 
variable  plant  parameters  and  are  independent  of  the  feedback  gains  hQ. 
Reference  to  Table  H.  3  indicates  that  gQ  and  zq  are  negative  for  all  flight 
conditions.  Substitution  of  Eq.  (8. 2-23)  into  Eq.  (8. 2-22)  produces 


Te(s) 


s  + 


k  g  (s-z  )  G  (s) 
e  o  o  cv  ' 


s-p 


m. 


s-p 


m. 


s-p, 


m. 


(8.2-24) 
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where  the  product  (k  g  )  is  always  positive  if  y.  and  y.  in  Eq.  (8.2-21)  are 

6  0  1  K 

positive. 


The  reference  model  poles  are  chosen  so  that  desirable  response 
characteristics  are  achieved;  for  this  investigation 

pm  =-40.0  u  =  -5.0  +j  5.0  pm  = -5.0 -j  5.0  (8.2-25) 

ml  m2  m3 

providing  a  time  constant  on  the  order  of  0. 2  sec.  Evaluation  of  z^  and  gQ 
for  representative  flight  conditions  indicates  that  their  ranges  are 
approximately 

300  -  |gQ|  "  15,000 

0.04  -  jzQ|  “  1.25  (8.2-26) 

Taking  G  (s)  =  1  —  corresponding  to  the  gradient  controller  derived  in 
c 

Eq.  (8. 2-9)  —  the  locus  of  poles  of  T  (s)  is  shown  qualitatively  in  Fig. 

C 

8.2-1.  As  k  increases  in  magnitude,  the  complex  poles  move  toward  the 
6 

jco  axis,  eventually  crossing  into  the  right-half-plane  causing  the  system 
to  become  unstable.  The  pole  at  the  origin  moves  toward  the  zero  of  G0(s). 

To  improve  the  adaptation  speed  of  the  system,  compensation 
should  be  introduced  so  that  the  real  parts  of  the  complex  poles  become 
more  negative  as  the  loop  gain  is  increased.  This  can  be  most  easily 
accomplished  by  specifying 

G  (s)  =  s-z  z  <  0  (8.2-27) 

c  c  c 

For  this  example,  we  take  z  =  -6.0  sec-1  and  compare  the  resulting  root 

c 

locus  in  Fig.  8.2-2  with  that  in  Fig.  8.2-1.  Evidently  much  better 
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damping  of  the  complex  poles  can  lie  achieved  as  the  adaptive  loop  gain  is 
increased. 


In  mechanizing  Eq.  (8.2-27)  note  that  G  (s)  is  not  realizable. 

c 

This  difficulty  is  circumvented  by  taking  Gc^(s)  in  Eq.  (8.2-22)  equal  to 
one  and 


Gc2(s)  =  s'zc 

Thus  from  Eq.  (8.2-19)  we  obtain 


G  (a)  P.(s) 

Hi<s)  -  -gy,  -S-y - 

Gc  (s)  Pk(s) 

K(s)  =  gyk  — ^ -  (8.2-28) 


where  H.(s)  and  K(s)  are  the  Laplace  transforms  of  h.(t)  and  k(t).  In  this 
manner  one  is  required  only  to  mechroiize  a  realizable  transfer  function 


GC  (B) 
c2 
s 


s  -z 
_ c 

s 


Recall  that  the  same  technique  is  employed  in  Example  4. 3-1. 

From  the  above  discussion  we  now  know  the  structure  of  the 
adaptive  control  system  to  within  specific  values  for  the  adaptation  gains 
and  some  additional  parameters  which  define  the  reference  model.  A 
block  diagram  of  the  system  is  shown  in  Fig.  8.2-3.  For  simulation 
purposes  switches  S  are  included  in  the  adaptive  loop  which  can  be  opened 
to  eliminate  the  integrator  by-pass  associated  with  the  accelerated 
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Figure  8. 2-3  An  Accelerated  Gradient  Controller 
for  a  Pitch  Rate  Autopilot 


gradient  algorithm.  This  permits  us  to  simulate  the  system  behavior  both 
with  and  without  adaptive  loop  compensation  to  provide  a  comparison  be¬ 
tween  conventional  gradient  and  accelerated  gradient  designs.  Of  course 
the  switches  would  not  be  present  in  an  actual  system  mechanization. 

Reference  Model  Specifications  —  The  fixed  parameters  for  the 
reference  model  are  chosen  to  be  identical  to  those  in  the  actual  autopilot 
at  flight  condition  6  (see  Section  H.l)  when  feedback  gains,  h  ,  are 
selected  so  that  the  system  poles  match  those  specified  in  Eq.  (8.2-25)  and 
km  is  chosen  so  that  the  d-c  gain  between  input  and  output  is  unity.  The 
rationale  for  this  selection  is  that  the  model  should  be  one  which  not  only 
has  satisfactory  response  characteristics  but  also  is  compatible  with  the 
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actual  airframe.  Thus  the  elements  of  A  and  b  are  determined  by 

m  — m 

requiring 


A6  "  -6-m 


k  bR 
m  —  b 


(8.2-29) 


where  Ag  and  bg  are  the  values  of  A  and  b  at  flight  condition  6  (see  Table 

H.3).  The  quantities  h  and  k  are  determined  from  the  relations 

— m  m 


Det 


Is"  A6+-6-m 


(s+40)  (s+5+j5)  (s+5-j5) 


lim  - : - 

s-°  cT[ls-A6+b6hTm]'  b6 


(8.2-30) 


The  result  of  carrying  out  the  above  calculations  is 


-1.63 

-1.42  x  10"2 

-2.93  x  10! 

3. 51  x  1 03 

-1.26 

-3. 18  x  10‘ 

0.929 

-3.45 x  10“4 

-47.1 

bT  =[  0.0  -4.47  x  103  -6.61]  (8.2-31) 

In  addition  some  method  is  required  for  specifying  the  initial 
values  of  the  adaptive  gains,  h(to)  and  k(t0).  This  is  accomplished  for  each 
simulation  by  assuming  the  initial  flight  condition  (i.e. ,  the  initial  values, 
Aq  and  b^,  of  the  airframe  parameters)  is  known,  matching  the  closed  loop 
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system  poles  to  those  of  the  model  and  requiring  the  d-c  gain  to  be  unity. 
Following  Eq.  (8.2-30),  h(t0)  and  k(t0)  are  obtained  from 

Det  [*Is-Ao+boh(to)T  1  =  (s+40)  (s+5+j5)  (s+5-j5) 


(8.2-32) 


Determine  Adaptation  Gains  —  The  adaptation  gains  0.  =  gyi  and 
=  gyk  must  be  selected  to  obtain  satisfactory  adaptation  properties. 

Recall  the  discussion  in  Section  4.2.6  suggesting  that  they  be  chosen  to 
contribute  equally  to  the  adaptive  loop  gain  k  .  Another  important  con- 
sideration  is  that  the  resulting  variations  in  the  gains  h.(t)  and  k(t)  as  adap¬ 
tation  proceeds  should  be  '  reasonable."  A  certain  amount  of  personal 
judgement  is  involved  in  this  part  of  the  design  process  and  some  trial  and 
error  experimentation  seems  unavoidable.  For  this  example  suitable  values 
of  the  quantities  y.  and  y,  are 

1  K 

yj  -  1.0  xlO4  y2  =  1.0 

y3  =  5.0  xlO2  yk  =  2.0  x  106  (8.2-33) 

which  size  the  adaptation  gains  relative  to  one  another.  Simulation  results 
are  presented  for  various  values  of  the  parameter  g  which  adjusts  the  level 
of  the  adaptive  loop  gain. 

Simulation  Results  ana  Evaluation  —  The  first  thing  to  be  demon¬ 
strated  is  the  improvement  in  adaptation  time  achieved  through  the  use  of 
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linear  compensation  in  the  adaptive  loop.  To  obtain  a  system  configuration 

equivalent  to  the  gradient -adaptive  controller  described  by  Eq.  (8.2-9) 

which  has  no  adaptive  loop  compensation,  the  switches  S  in  Fig.  8. 2-3  are 

opened  and  z  is  set  equal  to  -1.0.  A  set  of  system  simulations  is  per- 
c 

formed  beginning  at  flight  condition  4,  t  =  23.0  secs. ,  letting  the  system 
parameters  vary  linearly  as  prescribed  in  Eq.  (H-2).  The  system  output 
to  a  step  input  command,  v,  of  .026  rad/sec.  is  observed  for  an  interval 
of  one  second.  The  outputs  of  both  the  model  and  system  are  shown  in 
Fig.  8. 2-4  for  three  different  values  of  g.  Notice  that  increasing  the  gain 
does  little  to  reduce  the  error  over  the  one  second  interval  but  the  oscilla¬ 
tion  frequency  of  the  adaptive  loop  noticeably  increases.  The  latter  be¬ 
havior  is  qualitatively  predicted  by  the  root  locus  in  Fig.  8.2-1. 

Now  we  assess  the  effect  of  closing  the  switch  S  in  Fig.  8.2-3, 
which  mechanizes  the  accelerated  gradient  controller.  For  this  case  three 
one-second  runs  were  performed  for  various  values  of  g  under  the  same 
conditions  described  in  the  preceding  paragraph  except  that  the  compensa¬ 
tion  zero  is  located  at  -6.0.  The  corresponding  pitch  rate  curves  arc 
shown  in  Fig.  8.2-5.  Evidently  considerable  improvement  is  obtained  in 
adaptation  speed  when  the  adaptive  loop  gain  is  increased,  as  predicted  in 
Fig.  8. 2-2.  There  is  a  slight  tendency  toward  high  frequency  oscillations 
as  g  gets  larger  because  the  complex  poles  in  Fig.  8.2-2  asymptotically 
approach  a  line  parallel  to  the  jw  axis.  The  accelerated  gradient  controller 
performance  is  evidently  superior  to  that  exhibited  in  Fig.  8. 2-4  and  seems 
to  be  well  suited  for  missile  applications  where  rapidly  changing  missile 
parameters  require  a  rapidly  adapting  autopilot. 

Next  we  demonstrate  the  effect  of  variations  in  input  signal  level 
upon  adaptation  characteristics.  From  Eq.  (8.2-21)  one  expects  the  adap¬ 
tation  time  to  increase  as  v(t)  is  reduced  in  magnitude  because  the  adaptive 
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Figure  8.2-5  Step  Responses  of  an  Accelerated  Gradient  Adaptive 
Controller  for  a  Pitch  Rate  Autopilot 
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loop  gain  kg  also  decreases.  This  behavior  is  shown  in  Fig.  8.2-6.  First 
a  simulation  is  made  with 


|v(t)j  =  3.4x10  ^radian/sec  z  =  -6.0 

c 

g  =  1.5xl0“3 

The  corresponding  pitch  rates  for  the  reference  model  and  the  system  are 
shown  in  Fig.  8. 2-6(a)  over  a  two  second  period  beginning  at  flight  con¬ 
dition  4.  The  algebraic  sign  of  v(t)  is  switched  at  t  =  24.0  to  represent 
changing  input  commands  at  one  second  intervals.  Note  that  almost  no 

adaptation  takes  place.  When  the  input  command  level  is  increased  to 
-2 

10  rad/sec  the  adaptive  loop  gain  in  Eq.  (8. 2-21)  increases  and  much 
better  adaptation  is  observed  in  Fig.  8.2-6(b).  In  tactical  missiles  it  is 
expected  that  a  steering  command  will  be  present  most  of  the  time  while  a 
target  is  being  pursued  but  its  amplitude  may  fluctuate.  To  counteract  this 
effect,  the  gain  g  could  be  adjusted  in  response  to  the  measured  average 
value  of  the  steering  command.  For  example,  let 


g(t)  = 


(8.2-34) 


where  r  is  an  interval  somewhat  longer  than  the  model  settling  time  and 
k  is  a  proportionality  constant.  The  mechanization  of  Eq.  (8.2-34)  is 
known  as  a  signal-adaptive  technique  because  it  adapts  to  changes  in  input 
signal  characteristics.  Thus  there  can  be  a  need  to  make  adaptive  changes 
in  the  adaptive  loop  gain. 

Another  important  fact  to  be  illustrated  about  adaptive  systems 
is  that  good  adaptation  characteristics  for  one  state  variable  (e.  g„,  pitch 
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Figure  8. 2-6  Pitch  Rate  Responses  for  an  Adaptive  Pitch  Rate 
Autopilot  With  Different  Levels  of  Input  Commands 

rate)  do  not  necessarily  imply  good  adaptation  for  other  state  variables 
(e.  g. ,  normal  acceleration).  This  point  has  been  emphasized  in  Section 
8. 1. 2.  Suppose  the  designer  is  actually  interested  in  the  missile’s  nor¬ 
mal  acceleration  response  and  desires  that  the  latter  should  follow  the 
reference  model  acceleration  output  for  the  parameters  in  Eq.  (8.2-31). 

If  he  tries  to  accomplish  this  goal  by  designing  an  adaptive  pitch  rate  auto¬ 
pilot,  the  resulting  normal  acceleration  response  may  exhibit  exactly  the 
wrong  behavior.  This  claim  is  verified  by  examining  time  histories  of 
normal  acceleration  corresponding  to  the  simulations  performed  for 
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Fig.  8.2-6.  In  Fig.  8.2-7(a)  curves  of  normal  acceleration  for  both  the 
reference  model  and  the  system  are  shown  which  are  taken  from  the  simu¬ 
lation  performed  for  Fig.  8. 2-6(a).  Recall  that  very  little  adaptation  is 
achieved  for  pitch  rate  in  this  case.  However,  the  acceleration  responses 
are  reasonably  close  together,  indicating  good  autopilot  performance  with 
respect  to  normal  acceleration.  Figure  8. 2-7(b)  shows  acceleration 
response  curves  for  the  case  given  in  Fig.  8. 2 -6(b).  Now  good  adaptation 
is  obtained  in  pitch  rate  but  the  system’s  acceleration  response  relative  to 


TIME  j»«c) 


(b)  |  «(l  )|>  1.0  *  Id"2  rod /sec 

Figure  8.2-7  Normal  Acceleration  Responses  for  an  Adaptive  Pitch 

Rate  Autopilot  W’th  Different  Levels  of  Input  Commands 
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the  model’s  is  poor.  The  physical  reasons  for  this  behavior  are  discussed 
in  Section  8.1.2;  it  is  characteristic  of  aerodynamically  controlled  mis¬ 
siles. 


The  performance  of  any  adaptive  system  using  implicit  plant 
identification  should  also  be  evaluated  by  simulating  a  trajectory  which 
passes  through  several  different  flight  conditions.  This  will  test  the  ability 
of  the  adaptation  algorithm  to  adjust  the  adaptive  gains  so  that  the  system 
output  follows  the  model  output  as  the  plant  parameters  vary  with  time. 

For  this  purpose  the  air -to -surface  missile  trajectory  was  simulated  from 
flight  condition  2  at  t  =  6  seconds  through  flight  condition  4  to  t  =  26  seconds 
with  linearly  varying  plant  parameters.  Because  the  missile  is  thrusting 
during  the  first  part  of  this  segment  of  the  trajectory,  plant  parameters 
vary  over  a  wide  range,  as  indicated  in  Tables  H.l  to  H.4,  and  the  air¬ 
frame  is  actually  exposed  to  a  continuum  of  changing  flight  conditions.  Note 
particularly  that  dynamic  pressure  changes  by  a  factor  of  four  in  the  first 
two  seconds.  Pitch  rate  responses  are  exhibited  in  Fig.  8.2-8  for  two 
representative  sets  of  three  second  intervals,  one  at  the  beginning  and  the 
other  at  the  end  of  the  time  period  considered.  The  input  command  v(t)  is 
a  piecewise  constant  signal  which  changes  its  level  at  one  second  intervals, 
as  indicated.  These  results  indicate  the  ability  of  the  accelerated  gradient 
technique  to  rapidly  adapt  to  the  reference  model  and  maintain  a  small  out¬ 
put  error  for  a  wide  variation  in  the  missile  dynamic  characteristics. 

Finally,  it  is  important  to  point  out  potential  disadvantages  of 
this  method.  The  principal  defect  of  the  accelerated  gradient  technique 
(and  any  gradient  method)  is  that  satisfactory  behavior  cannot  be  guaran¬ 
teed.  The  root  locus  in  Fig.  8. 2-2  is  an  indication  of  local  stability 
properties  only;  nothing  can  be  said  about  global  behavior.  Indeed,  if  plant 
parameters  vary  too  rapidly  and  over  too  great  a  range,  the  validity  of  the 
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Figure  8.2-8  Pitch  Rate  Response  With  Time -Varying 
Airframe  Parameters 

gradient  arguments  used  in  designing  the  controller  becomes  questionable 
and  system  instability  may  result.  To  illustrate  this  fact  an  autopilot 
simulation  was  conducted  with  the  autopilot  in  Fig.  8.2-3  over  the  first  six 
seconds  of  the  trajectory  summarized  in  Table  H.  1.  From  zero  to  five 
seconds  the  airframe  parameters  are  constant  with  a  relatively  low 
dynamic  pressure.  At  time  t  =  5.0  secs.,  the  engine  ignites  and  delivers  a 
thrust  of  about  25  g’s,  causing  rapid  parameter  variations.  The  resulting 
behavior  of  the  adaptive  control  system  is  illustrated  in  Fig.  8. 2-9 „  For 
the  first  5  seconds  satisfactory  adaptive  operation  is  observed,  but  shortly 
after  the  engine  ignites  the  system  becomes  unstable.  This  simply  em¬ 
phasizes  the  fact  that  the  full  operating  regime  and  different  controller 
designs  (e.g.,  different  forms  of  G  (s)  in  Eq.  (8.2-22))  must  be  considered 

V 

when  testing  an  adaptive  technique  of  this  type.  For  example,  a 
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Figure  8.2-9  Pitch  Rate  Responses  With  Rapid  Airframe 
Parameter  Variations  That  Lead  to 
Autopilot  Instability 


modification  to  the  design  in  Fig.  8.2-3  that  might  improve  the  system 
stability  characteristics  is  the  addition  of  an  adaptive  filter  in  the  control 
loop  having  a  set  of  adaptive  gains  g.  The  filter  configuration  would  be 
chosen  such  that  for  each  possible  set  of  values  of  the  airframe  parameters 
the  autopilot  input-output  transfer  function  can  be  made  identical  to  that  of 
the  model  for  some  choice  of  gains  g,  h,  and  k.  Recall  that  the  set  of 
gains  —  h  and  k  —  provide  control  over  the  autopilot  closed  loop  poles  and 
the  steady-state  gain  but  not  over  the  closed  loop  zeros.  The  investigation 
of  this  possibility  should  be  the  subject  of  future  study. 
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The  pitch  rate  autopilot  described  here  serves  mainly  to  demon¬ 
strate  capabilities  and  limitations  of  the  accelerated  gradient  adaptive  con¬ 
trol  technique,  especially  its  improvement  over  conventional  gradient 
methods.  With  respect  to  missile  pitch  rate  control  we  make  the  following 
conclusions  on  the  basis  of  these  simulation  results: 

•  The  accelerated  gradient  technique  has  the  potential 
to  achieve  rapid  adaptation  in  the  presence  of  a 
time -varying  plant. 

•  Extremely  rapid  parameter  variations,  such  as  are 
encountered  while  thrusting,  can  cause  instability 
of  gradient -type  methods. 

•  Adaptive  pitch  rate  control  does  not  provide  good 
normal  acceleration  response  for  an  aerodynamically 
controlled  missile  because  the  dynamic  relationship 
between  pitch  rate  and  normal  acceleration  varies 
with  flight  condition. 

Evidently  the  autopilot  design  in  Fig.  8.2-3,  based  on 
forcing  missile  pitch  rate  to  follow  that  of  a  fixed  reference  model, 
is  not  practical  for  adaptively  controlling  the  normal  acceleration  of 
missile  airframes  which  depend  upon  aerodynamic  lift  to  develop 
control  forces.  We  shall  see  in  Section  8.2.4  that  this  conclusion 
can  be  reversed  if  an  adaptive  (time -varying)  reference  model  is 
permitted. 


8.2.3  A  Normal  Acceleration  Autopilot 

In  this  section  we  apply  the  accelerated  gradient  method  to  the 
problem  of  designing  a  normal  acceleration  autopilot  for  the  airframe  dais 
given  in  Appendix  H.  The  rationale  for  this  objective  is  that  the  adaptation 
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algorithm  should  be  based  upon  an  error  signal  which  directly  reflects 
the  difference  between  the  actual  and  the  desired  autopilot  perform¬ 
ance. 


Proceeding  in  the  same  manner  used  in  Section  8.2.2  we  first 
determine  the  value  of  N  in  Eq.  (8.2-5),  noting  that 

cT  =  [0  1  0].  (8.2-35) 

Referring  to  Eqs.  (8.1-3),  (8.1-4),  and  (8. 2-35)  it  is  determined  that 
when  N  is  one, 


T 

£  £  *  b2 

Consequently  N  =  1  and  from  Table  H.3  it  follows  that  bg  >  0  for 
all  flight  conditions.  Therefore  from  Eqs.  (8.2-5)  and  (8.2-8)  it 
follows  that 


0/  *  Hi  =  0,  >  0 

0j  =  ^  =  gyk  ^  >  0  (8.2-36) 


Combining  these  results  with  Eqs.  (8.2-9),  (4.3-5),  (8.2-15),  and  (8.2-35) 
one  obtains  the  following  equations: 
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Accelerated  Gradient  Adaptive  Controller 
for  a  Normal  Acceleration  Autopilot 


x(t)  =  (A-bh(tr )  x(t)  +  bk(t)  v(t) 


x  (t) 
-m 


Ef(s) 


Pi?) 


Pk(«  = 


W.(s)  = 


Wk(S)  = 
h.(t)  = 
k(t)  = 


A  x  (t)  +  b  v(t) 
m-m  -m 


x2(t)  '  xm2<t> 

Gc,(s)  E(s) 


ef(t)x.(t);  i  =  1, . .  ,n 
ef(t)  v(t) 

G_  (s)P.(s);  i  =  1, . .  ,n 


Gc2(s)  Pk(s) 
gy^t);  i  =  l,..,n 

-  ^kwk(t) 


(8.2-37) 


The  quantity  g  is  defined  as  in  Eq.  (8.2-21)  to  provide  control 
over  the  adaptive  loop  gain,  ke,  given  by  Eqs.  (8.2-15)  and  (8.2-36)  as 


k  =  g  y,  v  4  V'  y.  x  z  >0 
e  ^  k  m. 

i  =  l  i 


(8.2-38) 


where  g,  y, >  and  y.  are  all  positive  constants.  Now  we  proceed  to  complete 

K  1 

the  four  design  steps  described  in  Section  8.2.2  --  determine  the  adaptive 
loop  compensation,  select  the  reference  model,  determine  adaptation  gains, 
and  evaluate  performance  -  -using  the  airframe  data  given  in  Section  H.  1. 
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Adaptive  Loop  Compensation  —  To  determine  GCj(s)  and  GC2(s), 
examine  the  root  locus  of  the  denominator  in  the  transfer  function 


G  (s)  =  G  (s)  G  (s)  (8.2-39) 

c  Cj  c2 

associated  with  the  error  equation  in  Eq.  (8.2-15).  Recall  that  G  (s)  as 
given  by  Eq.  (8. 2-13)  is  assumed  to  have  poles  identical  with  those  of  the 
model  (see  Eq.  (4.2-43));  its  zeros  are  determined  by  evaluating  the 
numerator  in  Eq.  (8.2-13).  Using  A  and  b  as  defined  in  Eq.  (8. 1-4),  Gq(s) 
is  obtained  in  the  form 

b  =  a21a13a33 
0  a23 

a  =  -at1  (8.2-40) 
o  ll 

where  pm^,  pm2>  and  pmg  are  the  specified  reference  model  poles.  The 
gain  gQ  and  polynomial  coefficients,  aQ  and  bQ,  all  depend  upon  the  variable 
plant  parameters,  and  are  independent  of  the  feedback  gains  Table  H.  3 
indicates  that 

g  >  0,  a„  >  0,  b<  0 
eo  *  O  0 

and  furthermore 

Ib0l  »  laj 
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Consequently  the  numerator  of  Gq(s)  has  two  zeros,  and  with  values 
given  approximately  by 


zr  z2  -  1 

and  Te(s)  in  Eq.  (8. 2-39)  becomes 

Te^  keg0(r-z1)(s-Z2)  “ 

(s  'Pmj)(s'p“2)(s  "p“3) 


(8.2-41) 


(8.2-42) 


The  reference  model  poles  are  assi  gned  the  same  numerical 
values  as  in  Eq.  (8.2-25),  i.e., 


p  =  -5.0  +  j  5.0  p  =  -5.0 -j  5.0 


Evaluation  of  gQ  and  bQ  for  representative  flight  conditions  indicates  that 
the  ranges  of  the  transfer  function  gain  and  zeros  are  approximately 

1000  <  g  <  41000 
bo 

2.7  <  Jzjl,  \z2\  <  120 

Taking  G  (s)  =  1  --  corresponding  to  the  gradient  controller 

V 

derived  in  Eq.  (8.2-9)  —  the  first  difficulty  with  the  gradient  method  is 
encountered.  Because  of  the  right -half-plane  zero  in  the  numerator  of 
Gq(s)  (i.e.,  Gq(s)  is  a  nonminimum  phase  transfer  function),  the  locus  of 
poles  of  Te(s)  in  Eq.  (8.2-42)  for  kg  positive  and  gQ  fixed  has  one  branch 
entirely  in  the  right -half-plane,  as  indicated  in  Fig.  8.2-10.  Because  the 
algebraic  sign  of  k  is  specified  to  be  positive  in  Eq.  (8.2-38)  as  a  result 

V 


8-41 


THE  ANALYTIC  SCIENCES  CORPORATION 


$  PLANE 


*-20t» 


Figure  8. 2-10  Root  Locus  of  the  Closed  Loop  Adaptive  Control 

System  for  Positive  kg  With  No  Adaptive  Loop 
Compensation  (Gc(s)  =  1) 

of  the  design  procedure,  the  gradient-adaptive  controller  is  locally  unstable 
for  this  application.  This  is  clearly  an  unacceptable  operating  condition  and 
is  evidence  of  the  fact  that  the  gradient  concept  for  controller  design  is  not 
always  valid.  *  The  above  stability  problem  can  be  eliminated  by  assigning 


* 

This  behavior  is  attributable  to  the  fact  that  the  expansion  in  Eq.  (4. 2-24) 
used  to  derive  the  simplified  gradient  algorithm  provides  a  poor  approxi¬ 
mation  to  the  index  J  for  the  purpose  of  controlling  normal  acceleration. 

An  analysis  reveals  that  the  M.  I.T.  rule  described  in  Section  4. 2. 1  is 
also  locally  unstable  because,  in  this  particular  application,  the  con¬ 
tinuous  adaptation  algorithm  is  a  relatively  poor  approximation  to  the  ideal 
gradient  technique  expressed  by  Eq.(4.2-10).  The  possibility  of  the  latter 
effect  is  discussed  in  Appendix  C.  The  conclusion  is  that  neither  analog  gain 
adjustment  rule  described  in  Section  4.2  behaves  as  desired  for  this  appli¬ 
cation,  The  discrete  gain  adjustment  rule  discussed  in  Section  4.2.2  does 
not  suffer  from  either  of  the  above  noted  difficulties;  therefore  it  may  be 
a  better  method  for  this  application.  This  point  is  also  mentioned  at  the 
end  of  Section  4. 3. 
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a  negative  gain  to  the  compensating  transfer  function,  e.g.,  take  G  (s)  =  -1. 
However,  anew  complication  arises;  the  root  locus  shown  in  Fig.  8.2-11 
indicates  that  the  adaptive  system  becomes  unstable  for  sufficiently  high 
gain.  Consequently  in  order  to  improve  system  adaptation  time  as  we  did 
for  the  pitch  rate  autopilot,  more  thought  must  go  into  selecting  an  appro¬ 
priate  form  for  G  (s). 

V/ 


*-2089 


Figure  8.2-11  Root  Locus  of  the  Closed  Loop  Adaptive  System 

for  Positive  With  Negative  Gain  Compensation 
(Gc(s)  =  -1) 


The  feet  is  that  in  situations  like  that  described  above  no  form  of 

linear  adaptive  loop  compensation  can  function  to  permit  a  stable  system 

for  all  positive  values  of  k  .  The  right-half -plane  zero  will  always 

6 

"attract"  one  branch  of  the  locus  of  poles  of  T  (s).  After  some  experi- 

mentation  G  ,(s)  was  chosen  to  have  the  form 
0 


Gc(s)  = 


(8.2-43) 
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which  is  divided  between  Gc  (s)  and  GP  (s)  in  Eq.  (8.2-39)  as  follows: 


GC  (®) 
c2 


GC  (8)  = 

C1 


(8.2-44) 


The  structure  of  the  resulting  system  is  illustrated  in  Fig.  8.2-12. 


Specific  values  for  the  parameters  in  G(.(s)  are: 

z  =  -0.5 

C1 

p  =  -15.0  +  j  8.7 

C1 

z  =  -6.0  +  j  6.0 

C2 

p„  = -15.0 -j  8.7 
c2 

z  =  -6.0  -  j  6.0 

C3 

(8.2-45) 

The  root  locus  diagram  as  a  function  of  k  for  this  choice  of  compensation 

v 

is  shown  qualitatively  in  Fig.  8.2-13.  The  simulation  results  discussed 
below  indicate  that  the  restriction  on  the  range  of  variation  of  the  adaptive 
loop  gain  and  its  sensitivity  to  airframe  parameter  variations  prevents 
achieving  adaptation  characteristics  that  are  as  good  as  those  obtained  for 
pitch  rate  in  the  preceding  section. 

Reference  Model  Specifications  —  The  fixed  parameters  for  the 

reference  model  are  the  same  as  those  used  for  the  pitch  rate  autopilot. 

The  elements  of  A  and  b  are  given  by  Eq.  (8.2-31).  The  initial  values 
m  —  m 

of  the  adaptive  gains  are  the  solutions  to  Eq.  (8.2-32)  perturbed  by  10%  to 
provide  a  significant  error  signal. 
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Figure  8 . 2-12  An  Accelerated  Gradient  Controller 

for  a  Normal  Acceleration  Autopilot 


Selections  for  Adaptation  Gains  —  The  adaptation  gains  0.  =  gy. 
and  A  =  gy,  are  selected  by  the  same  procedure  described  for  the  pitch 

K  K 

rate  autopilot.  The  specific  values  used  in  this  example  are 


y^  -  7. 5  x  104 

y2  =  1'° 

y^  =  5,0  x  104 

\  ‘  °'5 

(8.2-46) 

Simulation  results  are  presented  for  various  values  of  the  parameter  g 
which  provides  control  of  the  adaptive  loop  gain. 

Simulation  Results  —  Graphs  of  normal  acceleration  output  for 
both  the  plant  and  the  reference  model  are  shown  in  Fig.  8.2-14  for  dif¬ 
ferent  values  of  the  gain  factor  g.  The  airframe  dynamics  are  constant 
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Figure  8. 2-13  Root  Locus  of  Te(s)  for  Positive  kg 

With  Gc(s)  given  by  Eq.  (8.2-43) 

and  correspond  to  flight  condition  4  in  Table  H.  1.  It  is  observed  that  a 

moderate  improvement  in  adaptation  properties  over  a  one -second  interval 

-8 

is  obtained  as  g  varies  from  0  (no  adaptation)  to  3.0  x  10  .  However  the 

results  are  not  so  dramatic  as  those  in  Fig.  8.2-5  for  the  pitch  rate  auto- 

.7 

pilot.  When  g  becomes  as  large  as  1 . 0  x  10  (Fig.  8. 2-14(d))  the  system 
is  unstable,  as  predicted  qualitatively  from  Fig.  8.2-13.  It  is  also  found 
that  the  best  value  of  the  gain  g  for  flight  condition  4  causes  the  system  to 
be  unstable  when  the  plant  dynamics  are  changed  to  flight  condition  6  in 
Table  H.l,  implying  that  careful  tuning  of  the  adaptive  loop  gain  is  required 
as  flight  conditions  vary  in  order  that  the  system  operates  satisfactorily. 
The  need  to  know  the  flight  condition  in  order  to  make  on-line  adjustments 
to  the  adaptation  algorithm  tends  to  negate  the  purpose  of  an  adaptive 
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controller  having  implicit  plant  identification;  namely,  it  is  desired  that 
explicit  identification  be  avoided. 


8.2.4  An  Adaptive  Reference  Model 

In  Sections  8.2.2  and  8.2.3  it  is  observed  that  satisfactory  adap¬ 
tive  normal  acceleration  response  is  difficult  to  achieve  with  the  accelerated 
gradient  method  for  the  particular  missile  application  we  are  considering. 
Good  pitch  rate  adaptation  characteristics  are  obtained  in  Section  8.2.2 
with  the  pitch  rate  autopilot;  however  the  corresponding  normal  accelera¬ 
tion  response  is  sensitive  to  the  missile’s  flight  condition.  The  latter  be¬ 
havior  is  explained  by  the  fact  that  the  relationship  between  the  transient 
dynamics  of  q(t)  and  a(t)  is  dependent  upon  the  values  of  the  airframe 
parameters.  Consequently  if  q(t)  is  forced  to  follow  the  output  of  a  fixed 
reference  model  at  all  flight  conditions,  the  transient  characteristics  of 
a(t)  must  necessarily  vary.  Alternatively  in  Section  8. 2. 3  an  adaptive 
normal  acceleration  autopilot  is  studied.  However,  the  nonminimum  phase 
character  of  the  airframe  input -output  transfer  function  causes  the  stability 
properties  of  the  control  system  to  be  quite  sensitive  to  flight  condition. 

To  achieve  better  normal  acceleration  response  from  the  pitch 
rate  autopilot,  one  can  ccnsider  using  an  adaptive  reference  model;  here¬ 
tofore  the  model  dynamics  have  been  time-invariant.  This  suggestion  is 
motivated  by  the  fact  that  at  each  flight  condition  the  adaptive  controller 
ideally  should  have  the  capability  to  make  the  compensated  airframe 
input-output  dynamics  identical  with  those  of  the  reference  model  for  some 
choice  of  the  adaptive  gains  h  and  k. .  We  have  already  noted  that  h  and  k 
alone  do  not  provide  enough  freedom  to  accomplish  this.  Therefore  an 
alternative  reference  model  is  advocated  which  is  obtained  by  continuously 
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identifying  enough  plant  parameters  and  simultaneously  changing  the 

corresponding  elements  in  A  and  b  so  that  the  associated  transfer 

m  -m 

functions  satisfy 

cT^Is-Am(a)^  bm(a)  =  cT|ls-A+bhT)  bk  (8.2-47) 

A 

for  some  values  of  k  and  h  at  each  flight  condition.  The  quantity  a  in 
Eq.  (8.2-47)  is  the  measurement  or  estimate  cf  tne  plant  parameters  re¬ 
quired  to  specify  the  reference  model. 

In  the  applications  we  have  been  considering,  where  full  state 
feedback  and  control  over  the  d-c  gain  is  assumed,  the  above  procedure 
is  equivalent  to  requiring  the  reference  model  and  missile  airframe  input- 
output  transfer  functions  to  have  the  same  zeros.  It  is  evident  from 
Eqs.  (8.1-10)  and  (8.1-11)  that  such  a  procedure  applied  to  a  pitch  rate 
autopilot  should  improve  normal  acceleration  response;  the  zero  of  Tm(s) 
will  be  adjusted  so  that  it  always  approximately  cancels  the  objectionable 
pole  in  A(s)/Q(s), 


System  Design  —  To  apply  the  above  idea  to  a  pitch  rate  auto¬ 
pilot,  recall  that  the  input-output  transfer  function  for  the  airframe  has 
the  form  (see  Eq.  (8.1-8)) 


Q(s)  _ 

m 


c(s-z) 


3  3 

s  +  a^s  +  agS  +a^ 


(8.2-48) 


Accordingly,  require  that  the  reference  model  input-output  transfer  func 
tion  be 


Qm(s) 


c  (s-z) 
m  ' 


3  2 

s+a  s+a  s+a 
ml  m2  m3 


(8.2-49) 
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where  z  is  an  estimate  of  z  and  c  satisfies 

m 


(8.2-50) 


The  coefficients  in  the  denominator  of  Eq.  (8. 2-49)  are  specified  by  the 
designer;  e.g., 


and  Eq.  (8.2-50)  provides  unity  d-c  gain  between  V(s)  and  Q^s), 

The  adaptive  controller  is  configured  almost  the  same  as  in 
Fig.  8.2-3.  One  exception  is  that  the  reference  model  dynamics  are  now 
given  by  Eq.  (8. 2-49)  which  requires  an  estimate  of  z.  Also,  since  we 
are  interested  in  commanding  pitch  rate  to  produce  a  desired  normal 
acceleration,  a  (t),  the  latter  must  be  scaled  using  the  steady  state  rela- 
tian  given  in  Eq.  (8.1-9);  i.e., 

v(t)  =  ac(t)/V 

where  V  is  an  estimate  of  the  missile’s  airspeed.  This  implies  that  air 
speed  must  also  be  identified.  Both  of  these  alterations  to  Fig.  8. 2-3  are 
indicated  in  Fig,  8. 2-15. 

Comparing  Eqs.  (8.2-48)  and  (8. 1-8)  it  follows  that 
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Figure  8. 2-15  An  Accelerated  Gradient  Controller  for  a  Pitch 

Rate  Autopilot  With  an  Adaptive  Reference  Model 

The  data  in  Table  H.3  indicate  that  the  denominator  in  Eq.  (8.2-50)  is 
dominated  by  the  term  ^^33);  therefore  z  -  a22*  In  other  words,  z  can 
be  approximately  identified  by  estimating  only  the  element  a22  in  Eqs. 
(8.1-3)  and  (8.1-4). 

The  subject  of  parameter  estimation  is  discussed  in  Chapters  5 
and  6  where  it  is  postulated  that  all  the  parameters  of  A  and  b  in  Eq.  (8. 1-4) 
are  to  be  identified.  When  that  capability  is  available,  the  control  methods 
described  in  Chapter  5  are  better  suited  for  autopilot  design.  However, 
here  we  are  suggesting  that  partial  plant  identification  can  be  beneficially 
applied  to  the  accelerated  gradient  method;  this  technique  may  offer  advan¬ 
tages  in  situations  where  it  is  easier  to  obtain  a  few  parameter  estimates 

A  A 

(z  and  V)  than  it  is  to  identify  all  the  elements  of  A  and  b. 
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Simulation  Results  —  The  adaptive  reference  model  prescribed 
by  Eqs.  (8.2-49)  and  (8,2-50)  was  simulated  for  the  accelerated  gradient 
controller  described  in  Section  8.2.2.  The  estimate  of  the  airframe  zero 
was  updated  continuously  according  to 


z(t)  =  a22(t) 

All  other  model  parameter  values  are  identical  to  those  used  for  the  simu¬ 
lation  associated  with  Figs.  8.2-6(b)  and  8. 2-7(b).  Recall  that  the  latter 
demonstrate  how  good  adaptation  in  pitch  rate  leads  to  poor  adaptation  in 
normal  acceleration  when  the  reference  model  dynamics  are  fixed. 

Initial  values  for  the  adaptive  gains  are  obtained  by  multiplying 
the  solutions  to  Eq.  (8.2-32)  with  a  scaling  factor  of  0. 70.  This  is  neces¬ 
sary  to  develop  a  significant  error  signal  when  the  adaptive  loop  is  dis¬ 
connected  (g  =  0  in  Fig.  8.2-15).  All  other  system  parameter  values  are 

the  same  as  those  used  for  Figs.  8.2-6(b)  and  8. 2-7(b).  Two  simulations 

-3 

were  performed;  one  with  g  =  0  (no  adaptation)  and  one  with  g  =  1. 5  x  10  , 
the  latter  being  the  same  value  used  for  the  above  referenced  figures. 

From  the  response  curves  displayed  in  Figs.  8.2-16  and  8.2-17,  it  is 
observed  that  the  adaptive  reference  model  enables  the  accelerated  gra¬ 
dient  technique  to  produce  the  desired  adaptive  response  characteristics 
in  both  pitch  rate  and  normal  acceleration. 

Conclusions  —  The  distinguishing  characteristic  of  the  method 
using  an  adaptive  reference  model  is  that  it  requires  identification  of  both 
a22  in  Eq.  (8. 1-4)  and  the  missile  airspeed,  V.  It  is  a  useful  technique 
for  modifying  the  structure  of  the  accelerated  gradient  controller  described 
in  previous  sections  so  that  desired  normal  acceleration  response  charac¬ 
teristics  can  be  obtained  for  the  tail-controlled  missile  in  Fig.  8.1-2. 
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(b)  NORMAL  ACCEIER.MION  RESPONSE 


Figure  8.2-17  Autopilot  Responses  for  a  System  Using  an  Adaptive 

Reference  Model:  Adaptive  Control  Loop  Connected 
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Although  estimates  of  only  two  parameters  are  required,  it  may  be  argued 
that  it  is  almost  as  easy  to  identify  ail  the  parameters  in  Eqs.  (8.1-4).  If 
so,  then  one  of  the  adaptive  control  methods  suggested  in  Chapter  5  may  be 
more  appropriate.  Further  investigation  of  this  question  should  be  pursued 
to  determine  whether  partial  plant  identification  of  the  type  advocated  above 
has  any  distinct  advantages  over  complete  identification. 


8. 3  APPLIC  \TION  OF  LIAPUNOV  DESIGN  TECHNIQUES 

In  Section  4.4. 3  a  Liapunov  synthesis  procedure  is  described 
which  is  suitable  for  adaptive  control  of  tactical  missiles  provided  the 
input -output  transfer  function  is  minimum  phase.  The  reason  for  this 
restriction  is  discussed  in  Section  4.4.4.  In  this  section,  the  method  is 
used  for  pitch  control  of  a  second  order  airframe  with  the  dynamics  of  the 
control  surface  actuator  neglected.  The  sequential  development  closely 
follows  Section  4.4,  3  to  provide  a  specific  illustration  of  the  steps  involved. 
The  discussion  evolves  in  three  steps  --  design  procedure,  selection  of 
parameters,  and  performance  evaluation.  Then  a  brief  discussion  is  given 
describing  how  the  adaptive  reference  model  concert  introduced  in  Section 
8.2.4  can  be  used  to  achieve  adaptive  control  of  normal  acceleration. 


8.3.1  Design  Procedure 

Let  the  input-output  relations  for  the  autopilot  design  problem  be 
represented  by  Fig.  8.3-1,  by  analogy  with  Fig.  4,4-3.  In  Laplace  trans¬ 
form  notation  the  equations  for  the  airframe  and  the  reference  model  are 
(s2  +  bs  +  a)  Y(s)  =  k(s  -  z)  U(s) 

(s2  +  bms  +  am)  Ym(s)  =  km(s  -  zm)  V(s)  (8.3-1) 
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ft -1917 


Figure  8. 3-1  Airframe  and  Reference  Model  Input-Output 
Relations  for  an  Adaptive  Control  System 


where  p^  is  a  negative  number  whose  allowed  range  of  values  is  to  be  deter¬ 
mined  presently.  The  result  after  carrying  out  the  required  number  of 
steps  in  dividing  by  p  (s)  is 

V 
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(s  "Vpi)E(s)  “ 


a  +pJb  +p«Y 

-S.  ^  m  - ll  E(s)  +  b  -b  Y(s) 
S-Pj  j  ni 

a  -a+p.fb  -bV 

— -  Y(s)  +  kU( 

s-px 

rz  km(Vzm)V(s> 


Y(s)  +  kU(s) 


prz 


U(s)  - 


k  V(s) 
m 


(8.3-3) 


which  has  the  same  form  as  Eqs.  (4.4-32)  and  (4.4-33).  Now  for  stability 
of  the  error  signal  we  require  that 


b  +p.  >  0 
m  1 


on  the  left-hand-side  of  Eq.  (8.3-3),  or 


P.  >  -b 
*1  m 


(8.3-4) 


This  provides  the  condition  needed  on  p  (s). 

0 


Referring  to  the  right-hand  side  of  Eq.  (8.3-3)  define  new 


variables 


Y  (s) 

-  Y(s) 

E  (s)  =  E(s) 

c 

S-Pj 

c  S-Pj 

U  (s) 

-  U(s) 

V  (s)  =  V<s> 

c 

S-Pj 

C  s-Pl 

(8.3-5) 
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and  vector  quantities 


b  -b 
m 

y(t) 

am"a+pi<bm-b} 

yc(‘) 

prz 

;  f(t)  = 

»c(t) 

km(zm-Pl) 

vc(t) 

"am"Pl.(bm+Pl) 

ec(t) 

Rewriting  Eq.  (8.3-3)  in  the  time  domain, with  the  aid  of  Eqs.  (8.3-5)  and 
(8.3-6),  produces  a  differential  equation  for  the  error  in  the  form  of 
Eq.  (4.4-42), 


6(t)  =  “(bm+Pi)  e(t)  +  pT  f(t)+ku(t)-km7(t) 


(8.3-7) 


The  adaptive  controller  is  derived  directly  from  Eq.  (4.4-43). 
Because  Eq.  (8. 3-7)  is  a  scalar  error  equation,  Q  and  ^  in  Eq.  (4. 4-43) 
are  also  scalars  to  which  we  can  assign  the  value  one.  Also,  since  A  is 
an  arbitrary  positive  definite  matrix  and  k  is  unknown,  choose  the  former 
to  be  diagonal  and  replace  the  latter  by  its  algebraic  sign.  The  application 
we  are  considering  is  the  same  as  in  Section  8. 2. 2  so  that  k  is  the  same  as 
g  in  Eq.  (8.2-23),  which  is  a  negative  quantity  for  all  flight  conditions. 
With  these  assumptions,  the  adaptation  algorithm  becomes 


u(t)  =  (-hc(t) 

T  +  oe(t) 

1 - 1 

s  s 

T 

A'1) 

v(t)_ 

h c(t)-  -  A'1 

■«t)- 

v(t) 

e(t) 

(8.3-8) 
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where  the  six  diagonal  elements  of  A  and  the  positive  scalar  a  remain  to 
be  selected. 

The  implementation  of  this  adaptive  system  is  illustrated  in 
Fig.  80  3-2.  To  provide  a  means  for  exercising  control  over  the  total 
adaptive  loop  gain,  define  the  i*h  element  of  A  to  be  XX.  where  the  quanti¬ 
ties  Xi  are  chosen  to  achieve  the  proper  relative  weighting  on  the  elements 
of  j:(t)  and  v(t)  in  Eq.(8.3-8)  and  X  is  a  scaling  factor.  The  adaptation  al¬ 
gorithm  is  computationally  more  complex  than  the  accelerated  gradient 
method  described  in  Section  8.2;  it  requires  two  additional  adaptive  gains 
and  mechanization  of  the  filters  with  transfer  function  l/pc(s). 

To  set  initial  values  for  the  feedback  gains  h  (t),  Eq.  (4. 4-47) 

c 

is  solved  for  the  initial  flight  condition  with  the  modification , 


The  quantity  r?  is  a  positive  factor  which  permits  the  adaptive  gains 
to  be  initially  "detuned”  from  their  optimum  values.  It  represents 
whatever  inaccuracy  may  exist  in  knowledge  of  the  initial  values  of  the 
plant  parameters. 


8.3.2  Selection  of  Parameters 

For  simulation  purposes  the  airframe  dynamics  are  taken  from 
the  trajectory  data  in  Section  H,1  with  the  actuator  dynamics  neglected.  The 
dynamics  of  the  reference  model  are  taken  from  flight  condition  6  in  Table 
H.3.  The  controller  gains  --  X .,  a  and  p^,  i  =  1, . . ,  6  —  are  assigned 
values 
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1/Xj  =  9.0  xlO2 

1/X2  =  3.0xl03 

1/X3  =  3.0x10° 

1/X4  =  2.34  x  104 

1/X5  =  1.8  xlO3 

1/X6  =  6.0  xlO4 

Pj  =  3.0 

a  =  0.167 

The  quantity  X  is  a  parameter  used  to  adjust  the  total  effective  loop  gain. 


8. 3. 3  Performance  Evaluation 


To  demonstrate  the  operation  of  the  system  illustrated  in  Fig. 

8. 3-2,  a  simulation  was  conducted  with  the  plant  parameters  fixed  at  the 
values  specified  by  flight  condition  2  in  Table  H.  1  and  an  adaptive  loop  gain 
X  *  =  5. 5.  The  initial  values  of  the  adaptive  gains  h^t)  were  computed 
from  Eq.  (8.3-9)  with  t)  =  C.  6  and  a  step  pitch  rate  command,  v  =  0.01 
radians/second  was  applied.  In  Fig.  8.3-3  the  reference  model  response, 
the  airframe  response,  hnd  the  associated  Liapunov  function  are  plotted 
as  a  function  of  time.  The  airframe  response  without  adaptation  (X*  =  0) 
is  also  shown  for  comparison.  Evidently  significant  improvement  is  pro¬ 
vided  by  the  adaptive  design.  Observe  that  the  Liapunov  function  decreases 
most  rapidly  initially,  when  tire  error  is  largest,  as  expected  from  Eq. 
(4.4-45).  It  should  be  noted  that  the  large  overshoot  in  the  reference  model 
response  is  required  in  order  to  rapidly  develop  normal  acceleration.  Thus 
a  pitch  rate  model  is  chosen  on  the  basis  of  the  desired  normal  acceleration 
response,  just  as  we  did  in  simulations  for  the  accelerated  gradient  method. 

To  demonstrate  the  performance  of  this  system  design  in  the 
presence  of  time-varying  system  dynamics,  the  trajectory  in  Section  H.l 
was  simulated  over  a  three  second  interval  beginning  at  t  =  7  seconds, 
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0  19*0-0 


Figure  8. 3-2  Liapunov  Design  Technique  Applied 
to  a  Second  Order  Airframe 


"half-way”  between  flight  conditions  2  and  3  in  Table  H.l.  This  includes 
a  portion  of  the  missile's  thrusting  period.  The  initial  values  of  the  adap¬ 
tive  gains  were  calculated  according  to  Eq.  (8. 3-9)  with  the  detuning  factor 
t)  equal  to  0.75.  The  response  curves  are  shown  in  Figs.  8. 3-4(a)  and  (b)0 
The  error  between  the  reference  model  output  and  the  airframe  pitch  rate 
is  plotted  explicitly  in  Fig,  8.3-4(b)  because  of  its  small  magnitude  rela¬ 
tive  to  the  pitch  rate.  Evidently  the  control  system  is  able  to  maintain  a 
very  small  error  level  relative  to  the  mode),  response;  the  performance  is 
somewhat  better  than  that  achieved  with  the  accelerated  gradient  method 
over  the  same  trajectory  (see  Fig,  8.2-8  for  comparison). 
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(b)  LIAPUNOV  FUNCTION 

Figure  8.3-3  Step  Response  of  Control  System  in 
Fig.  8.3-2  for  a  Constant  Plant, 
With  (X"l  =  5. 5)  and  Without 
(X“*  =  0)  Adaptation 
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(b|  PITCH  SATE  ERROR  SIGNAL 


Figure  8.3-4  Pitch  Rate  Response  of  Reference  Model  and  Pitch 
Rate  Error  Signal  for  the  Liapunov  Design  in 
Fig.  8.3-2 


In  Fig.  8.3-5  the  effect  of  setting  the  gain  a  to  zero  in  Eq. 
(8.3-8)  is  demonstrated;  this  is  similar  to  opening  the  switch  S  in  Fig. 

8. 2- 3  for  the  accelerated  gradient  method.  Recall  that  the  term  in  the  ex¬ 
pression  for  the  feedback  control  associated  with  this  gain  is  designed  to 
improve  the  adaptation  rate  of  the  system.  When  the  simulation  of  Fig. 

8. 3- 4  was  rerun  with  a  =  0,  the  resulting  error  signal  shown  in  Fig.  8.3-5 
was  significantly  larger  than  that  in  Fig.  8.  3-4(b)  indicating  that  a  /  0 
does  improve  the  system  adaptation  characteristics. 
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Figure  8.3-5  Pitch  Rate  Error  Signal  With  a  =  0 

The  controller  described  above  is  designed  for  a  second  order 
airframe,  neglecting  actuator  dynamics.  To  determine  the  effect  of  the 
actuator,  a  simulation  was  performed  using  the  same  controller  as  in 
Fig.  8. 3-2  with  a  first  order  lag  inserted  between  the  input  u(t)  and  the 
airframe.  The  time  constant  of  the  lag  is  0.02  seconds,  taken  from  the 
data  in  Section  H.  1.  The  trajectory  is  the  same  as  for  Figs.  8. 3-4  and 
8. 3-5  and  the  resulting  error  signal  is  shown  in  Fig.  8.3-6.  As  contrasted 
with  Fig.  8.3-4(b)  a  relatively  large  transient  error  is  incurred  each  time 
the  input  signal  v(t)  changes  sign.  However  its.duration  is  short  —  on  the 
order  of  0. 2  seconds  which  is  comparable  to  the  reference  model  response 
time. 


Finally,  we  wish  to  demonstrate  that  the  Liapunov  technique  is 
capable  of  providing  good  adaptive  performance  over  the  entire  thrusting 
period  of  the  trajectory  in  Section  H.  1.  The  simulation  was  performed 
for  a  three  second  interval  beginning  at  5  seconds  with  the  gain  detuning 
factor  r\  set  at  0. 9  in  Eq.  (8. 3-9).  The  pitch  rate  responses  of  both  the 
reference  model  and  the  airframe  are  given  in  Fig.  8.3-7.  Again  the  adap¬ 
tive  controller  succeeds  in  tracking  the  reference  model  output  very  well, 
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Figure  8. 3-6  Pitch  Rate  Error  Signal  With  Actuator  Lag 


Figure  8.3-7  Pitch  Rate  Response  for  Rapid  Airframe 
Parameter  Variations 


even  though  plant  parameters  are  varying  rapidly.  This  behavior  contrasts 
favorably  with  the  operation  of  the  accelerated  gradient  technique  in  Fig. 
8.2-9.  However  it  may  be  simply  a  fortuitous  circumstance  because  the 
theory  predicts  good  operation  only  when  plant  parameters  are  constant. 
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8.3.4  An  Adaptive  Reference  Model 

When  a  partial  plant  identification  capability  is  available,  good 
adaptive  normal  acceleration  response  can  probably  be  achieved  if  the 
Liapunov  design  technique  is  modified  to  include  an  adaptive  reference 
model,  just  as  described  for  the  accelerated  gradient  method  in  Section 
8.2.4.  The  use  of  an  adaptive  reference  model  forces  the  system  to 
respond  to  input  commands  in  such  a  fashion  that  the  airframe  acceleration 
output  has  the  desired  characteristics.  The  implementation  of  this  idea  is 
illustrated  by  the  conceptually  simply  modifications  to  Fig.  8.3-2  indicated 
in  Fig.  8.3-8.  Only  estimates  of  airspeed  and  the  actual  airframe  zero,  z, 
are  required.  No  simp’s4 ions  were  performed  for  this  design. 


Figure  8. 3-8  Liapunov  Design  Technique  Applied  to  a  Second  Order 
Airframe  With  an  Adaptive  Reference  Model 
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The  same  comments  made  for  the  accelerated  gradient  technique 
concerning  the  practicality  of  an  adaptive  reference  model  apply  to  the 
Liapunov  procedure.  If  it  is  deemed  easier  to  identify  only  a  few  airframe 
parameters  (z  and  V)  instead  of  the  entire  plant,  the  procedure  described 
above  may  offer  computational  advantages  over  the  methods  described  in 
Chapter  5  which  require  complete  plant  identification.  Further  investiga¬ 
tion  of  the  adaptive  reference  model  concept  is  needed  to  determine  its 
merits  vis  avis  complete  identification. 


8. 4  SUMMARY  AND  CONCLUSIONS 

8.4d  Summary 

The  Accelerated  Gradient  Method  —  The  accelerated  gradient 
method  is  applied  to  pitch  rate  and  normal  acceleration  autopilots  in 
Sections  8. 2. 2  and  8.2.3.  The  technique  is  characterized  by  a  time- 
in  variant  reference  model  and  no  plant  identification  capability.  The  prin 
cipal  observations  are  that  relatively  good  adaptive  pitch  rate  control  can 
be  achieved  whereas  the  normal  acceleration  autopilot  adapts  slowly  and 
its  stability  characteristics  are  sensitive  to  flight  conditions.  The  latter 
behavior  is  caused  by  the  fact  that  the  airframe  input-output  transfer 
function  has  a  right -half-plane  zero  whose  location  is  a  function  of  flight 
condition.  When  a  partial  plant  identification  capability  is  feasible,  it  is 
demonstrated  in  Section  8.2.4  that  an  adaptive  reference  model  used  in 
conjunction  with  a  pitch  rate  autopilot  can  yield  good  adaptive  normal 
acceleration  response. 

A  specific  disadvantage  of  gradient  type  methods  is  that  their 
global  stability  properties  are  generally  unknown  find  situations  where  the 
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system  becomes  unstable  can  occur.  A  large  amount  of  simulation  with 
some  trial  and  error  design  may  be  required  to  arrive  at  a  satisfactory 
system  configuration 

Liapunov  Design  Method  —  The  Liapunov  design  technique  is 
applied  to  a  missile  pitch  rate  autopilot  in  Section  8.3;  as  already  noted  in 
Section  4.4.4,  this  method  is  suited  only  for  controlling  minimum 
phase  plants.  The  controller  configuration  is  characterized  by  a  time- 
invariant  reference  model  and  no  plant  identification  capability.  Rapid 
adaptation  characteristics  are  observed  in  all  simulations  performed. 
However,  it  should  be  emphasized  that  the  theory  ensures  the  desired 
global  stability  properties  only  when  plant  parameters  are  constant.  During 
periods  of  rapidly  changing  flight  conditions,  there  is  no  guarantee  that  the 
system  will  remain  stable.  Just  as  with  gradient  methods,  considerable 
simulation  is  required  to  definitively  establish  its  usefulness  for  a  par¬ 
ticular  set  of  circumstances.  The  results  presented  here  indicate  that 
Liapunov  synthesis  procedures  are  feasible  and  promising  for  controlling 
the  output  of  a  minimum  phase  airframe  transfer  function,  i.  e. ,  one  which 
has  no  right -half -plane  zeros. 

In  order  to  circumvent  the  restriction  to  minimum  phase 
plants  when  an  adaptive  normal  acceleration  autopilot  is  needed  for  tail- 
controlled  missile,  the  adaptive  reference  model  concept  is  suggested  in 
Section  8.3.4.  The  latter  is  implemented  in  the  same  fashion  as  described 
in  Section  8.2.4  for  the  accelerated  gradient  method. 


8.4.2  Conclusions 

The  simulation  results  presented  in  this  chapter  indicate  that 
both  the  accelerated  gradient  and  Liapunov  design  methods  are  feasible  for 
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adaptive  control  of  a  missile  airframe,  subject  to  wide  parameter  varia¬ 
tions  whose  input-output  transfer  function  is  minimum  phase.  The 
Liapunov  design  technique  achieves  somewhat  better  adaptation  charac¬ 
teristics  because  of  the  inherent  stability  properties  of  the  resulting  adap¬ 
tive  controller. 

Both  of  the  above  techniques,  as  developed  in  Chapter  4,  are  not 
well-suited  for  controlling  normal  acceleration  in  the  tail -controlled  mis¬ 
sile  illustrated  in  Fig.  8.1-1  because  of  the  associated  nonminimum  phase 
transfer  function.  We  have  pointed  out  that  one  way  of  overcoming  this 
problem  is  to  utilize  an  adaptive  reference  model  with  a  pitch  rate  autopilot; 
the  net  effect  is  to  control  both  pitch  rate  and  normal  acceleration.  Another 
possible  solution  is  a  change  in  the  airframe  design  that  will  eliminate  the 
nonminimum  phase  characteristic.  Some  possible  methods  of  accomplish¬ 
ing  the  latter  are  outlined  below. 

The  right -half-plane  zero  in  the  transfer  function  between  control 
surface  deflection  and  noimal  acceleration  for  the  missile  in  Fig.  8.1-1 
occurs  because  the  control  surfaces  are  mounted  on  the  tail,  behind  the  cen¬ 
ter  of  gravity.  To  produce  an  acceleration  in  one  direction  the  control  sur¬ 
face  must  deflect  at  an  angle  which  initially  produces  an  acceleration --i.e., 
a  force  on  the  control  surface  --in  the  opposite  direction.*  The  right-half¬ 
plane  zero  can  be  removed  by  putting  the  control  surface  forward  of  the 
missile’s  center  of  gravity  in  a  canard  configu ration  (Refs.  129,  130). 
Although  this  airframe  structure  sometimes  has  certain  aerodynamic  dis¬ 
advantages  --  e.  g. ,  the  drag  may  be  excessive  and  the  flow  of  air  over  the 
primary  lifting  surfaces  (wings)  can  be  distorted  by  the  wake  of  the  canard 


* 

See  the  curves  in  Fig.  8.2-14  where  the  acceleration  just  after  t  =  0 
is  in  the  direction  opposite  to  that  commanded. 
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structure  —  it  is  worthwhile  considering  for  alleviating  the  problems 
encountered  in  normal  acceleration  control. 

Another  possible  airframe  configuration  is  a  form  of  direct  lift 
control  (Refs.  129,  131,  132)  provided  by  a  rotatable  wing  near  the  center 
of  gravity,  in  conjunction  with  tail  controls.  Many  of  the  problems  asso¬ 
ciated  with  missile  autopilot  design  are  a  result  of  the  fact  that  the  air¬ 
frame  must  pitch  to  increase  angle  of  attack  to  produce  aerodynamic  lift. 
The  ability  bf  the  control  surfaces  to  generate  pitching  moments  varies 
with  flight  condition,  also  causing  the  airframe  response  characteristics 
to  vary.  This  difficulty  is  largely  eliminated  if  the  airframe  wings  can  be 
rotated  with  respect  to  the  fuselage  in  the  same  fashion  as  the  tail  control 
surface.  This  provides  the  capability  for  rapidly  changing  the  angle  of 
attack  (with  respect  to  the  wings)  so  that  the  missile  can  quickly  develop  a 
lifting  force  at  any  flight  condition.  In  this  configuration  the  tail  control 
surface  may  still  be  needed  to  maintain  airframe  stability. 

Finally  we  note  that  the  need  for  adaptive  control  in  missile 
applications  arises  largely  through  variations  in  airframe  aerodynamic 
properties .  Therefore  the  need  for  an  adaptive  autopilot  may  be  reduced  if 
a  method  of  control  —  such  as  thrust  vectoring*  —  is  employed  which  does 
not.  depend  on  aerodynamic  forces  for  developing  control  moments  and 
maneuvering  accelerations. 

This  chapter  has  treated  a  variety  of  methods  associated  with 
implicit  plant  identification  for  adaptively  controlling  a  missile  airframe.  In 
the  next  chapter  techniques  are  considered  which  depend  upon  explicit 
plant  identification. 

Thrust  vectoring  refers  to  a  concept  where  a  variable  component  of  the 
main  engine  thrust  vector  normal  to  the  missile’s  longitudinal  axis  is  used 
to  control  airframe  pitch  motion,,  and  the  primary  maneuvering  force  is 
supplied  by  the  longitudinal  component  of  thrust  combined  with  a  large 
angle  of  attack. 
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9.  APPLICATIONS:  PARAMETER  ADAPTIVE  CONTROL 

SYSTEMS  WITH  EXPLICIT  PLANT  IDENTIFICATION 


In  Chapter  5  a  number  of  adaptive  techniques  depending  upon 
explicit  identification  of  plant  parameters  are  reviewed.  Three  of  these  -- 
using  pole  assignment,  optimal  regulator,  and  optimal  model  following  con¬ 
cepts  --  are  examined  here  for  possible  application  to  tactical  missiles.  In 
applying  each  of  these  methods,  the  actuator  and  airframe  are  considered 
to  comprise  a  third  order  linear  system  with  dynamics  as  specified  in 
Eqs.  (8.1-3)  and  (8.1-4).  A  functional  block  diagram  of  the  adaptive  con¬ 
troller  is  shown  in  Fig.  9.1-1.  The  airframe  equations  of  motion  are  known 
to  within  a  set  of  varying  parameters,  a,  that  can  be  accurately  estimated 
by  one  of  the  methods  discussed  in  Chapter  6.  All  state  variables  —  control 
surface  deflection,  normal  acceleration,  and  pitch  rate  —  are  considered 
available  for  use  in  the  controller  and  for  providing  parameter  estimates,  a. 

In  all  the  applications  examined  here  the  controller  structure  is 
linear,  with  adaptive  gains  whose  values  depend  upon  the  parameter  esti¬ 
mates,  as  indicated  in  Fig.  9.1-2.  The  equations  of  motion  have  the  form 

x(t)  =  A(a)  x(t)  +  b(a)  (-h(a)T  x(t)  +  k(a)  v(t)) 

y(t)  =  a(t)  =  cTx(t);  cT  =  [0  1  0]  (9.1-1) 

There  is  one  gain  associated  with  each  state  variable  and  a  fourth  is  applied 
at  the  input  to  give  the  proper  steady  3tate  (d-c)  gain  between  v(t)  and  the 
airframe  output.  In  this  chapter  we  shall  think  of  the  output  as  being  nor¬ 
mal  acceleration,  consistent  with  the  steering  commands  expected  from  the 
guidance  loop.  The  only  functional  difference  between  each  type  of  adaptive 


9-1 


THE  ANALYTIC  SCIENCES  CORPORATION 


THE  ANALYTIC  SCIENCES  CORPORATION 


9. 1  POLE  ASSIGNMENT 


An  adaptive  procedure  for  selecting  feedback  gains  to  position  the 
closed  loop  poles  for  a  controllable  linear  system  is  described  in  Section 
5.3.  The  gains  are  obtained  as  functions  of  the  parameter  estimates  a 
by  solving  Eq.  (5.3-7),  rewritten  here  in  the  form 


Det 


Is  -  A(a)  +  b(a)h^  = 


3 

=  =  +  a  s 
m 


b  s 
m 


+  c 


m 


(9.1-2) 


For  the  missile  application  A  and  b  are  defined  in  Eq.  (8. 1-4)  and  are 
regarded  here  as  functions  of  the  estimated  parameters.  The  quantities  — 
am’  ^m  cm  ““  are  coe^icients  °f  the  expanded  polynomial  on  the 
right  hand  side  of  Eq.  (5.3-7);  i.e.. 


3  2  . 

s  +a  s  +b  s+c 
m  m  m 


b  =  pw  v  +  pm  pw  +Pm  pm 
m  m1  mg  m^  m3  m2  ^^3 


c  =  -pM  pw 
m  m^  mg  m^ 


(9.1-3) 


where  the  quantities,  p^,  are  the  desired  closed  loop  poles.  Equation 
(9.1-2)  is  solved  by  expanding  the  determinant  and  equating  like  powers 
of  s.  The  result  is  a  linear  set  of  equations  having  a  solution  given  by 

h(a)  =  P(af 1  d(a)  (9.1-4) 


where 
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A  A 
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(9.1-5) 


The  carat  superscript  denotes  those  elements  of  A  and  b  which  must  be 
estimated;  a^  and  bg  describe  the  actuator  dynamics  which  are  assumed 
to  be  known.  The  gains  h  can  be  updated  from  Eqs.  (9. 1-4)  and  (9. 1-5) 
whenever  new  parameter  estimates  are  generated. 


The  d-c  gain  k(a)  is  letermined  so  that  the  acceleration  output 
equals  the  input  in  the  steady  state  when  v(t)  is  constant.  In  other  words, 
setting  x(t)  equal  to  zero  in  Eq.  (9.1-1)  and  solving  for  a(t),  we  require 

a(t)  =  -cT(A-bhT)  bkv  =  v  (9.1-6) 

Thus  the  solution  for  k  becomes 


k(a)  *  -- - - - q -  (9.1-7) 

cT  (A(a) -b(a)  h(a)T)  b(a) 

Equations  (9. 1-4)  and (9. 1-7) together  completely  specify  the  adaptive  con¬ 
troller. 


Application  —  Using  the  air-to-surface  trajectory  data  given  in 
Section  H.  2  of  Appendix  H,  an  autopilot  control  system  was  simulated.  The 


9-4 


THE  ANALYTIC  SCIENCES  CORPORATION 


airframe  dynamics  along  the  trajectory  were  made  time-varying  by  the  linear 
interpolation  procedure  describe'’  Section  H.l.  The  values  of  the  assigned 
poles,  pmi  in  Eq.  (9.1-3),  are 

-60.0 

-23.0  ±j  17.6  (9.1-8) 


These  particular  choices  provide  a  comparison  with  the  adaptive  optimal 
controllers  discussed  later  in  this  chapter. 


If  the  adaptive  gains  are  continuously  computed  from  Eqs.(9.1-3) 
and  (9.1-6),  the  instantaneous  system  poles  will  always  be  at  the  values 
specified  by  Eq.  (9.1-7).  However,  practically  speaking  it  takes  time  to 
obtain  parameter  estimates  and  to  carry  out  the  calculations  needed  to 
determine  h  and  k.  In  addition,  the  computation  equipment  must  generally 
perform  other  tasks  related  to  guidance  and  navigation.  Consequently  it  is 
possible  to  up-date  the  adaptive  gains  only  at  discrete  times  separated  by 
intervals  of  some  specified  length.  During  each  interval  the  plant  param¬ 
eters  continue  to  vary  and  the  closed  loop  system  poles  depart  from  the 
desired  values.  Therefore  one  measure  of  the  performance  of  the  system 
and  an  indication  of  how  often  the  gains  should  be  recomputed  is  provided 
by  a  graph  showing  variations  in  pole  location  as  a  function  of  time.  For¬ 
tunately  not  all  of  the  poles  need  be  examined,  only  those  nearest  the 
imaginary  axis  contribute  significantly  to  the  system  response.  It  is  the 
real  part  of  the  latter  which  is  of  most  interest  because  it  is  closely  related 
to  the  system  settling  time. 


The  variation  of  the  real  part,  or,  of  the  dominant  complex  poles 
of  the  missile  autopilot  along  the  trajectory  is  shown  in  Fig.  9.1-3.  The 
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Figure  9.1-3  Value  of  the  Real  Part  of  the  Most 
Significant  Closed  Loop  Pole: 

Pole  Assignment  Technique 


adaptive  gains  k  and  h  were  recomputed  every  second,  resulting  in  dis¬ 
continuous  changes  in  a  as  indicated  by  the  dashed  lines  in  the  figure. 

Notice  the  especially  rapid  changes  near  the  beginning  of  the  trajectory. 
This  behavior  is  partially  caused  by  the  fact  that  the  missile  airframe 
parameters  vary  most  rapidly  during  the  boost  phase  of  the  trajectory; 
also  the  closed  loop  poles  are  most  sensitive  to  changes  in  plant  param¬ 
eters  vhen  the  dynamic  pressure  is  low.  The  latter  effect  is  caused  by  the 
fact  ina;:  the  vector  b  in  Eq.  (9. 1-2)  varies  with  flight  condition.  Conse¬ 
quently  during  those  periods  of  time  when  the  feedback  gains  h  are  constant, 
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the  sensitivity  of  the  closed  loop  poles  to  changes  in  b  is  partially  deter¬ 
mined  by  h.  When  dynamic  pressure  is  low  the  magnitude  of  b  tends  to  be 
small,  requiring  large  feedback  gains  to  achieve  specified  closed  loop  poles; 
hence  the  sensitivity  to  parameter  variations  tends  to  be  greatest  at  low 
dynamic  pressure.  This  is  one  illustration  of  a  case  where  feedback  can 
increase  system  sensitivity  to  changes  in  plant  dynamics  (see  the  related 
discussion  in  Section  7.1).  From  the  trend  indicated  during  the  first 
second  of  flight  it  is  seen  that  the  autopilot  would  become  temporarily  un¬ 
stable  if  the  gains  were  updated  less  frequently.  The  above  observations 
indicate  that  the  pole  assignment  technique  should  yield  a  satisfactory 
response  time  over  most  of  the  trajectory  provided  plant  identification  is 
rapidly  accomplished.  A  more  definitive  judgement  about  the  system’s 
performance  can  be  made  from  viewing  sample  autopilot  response  curves. 
These  are  provided  in  Section  9.4  and  are  compared  with  the  other  adap¬ 
tive  methods  being  evaluated. 

The  implementation  of  the  adaptive  pole  assignment  controller 
is  relatively  easy  to  accomplish  and  is  therefore  quite  competitive  with 
other  techniques  which  require  plant  identification.  The  mechanization  of 
Eqs.  (9.1-4)  and  (9.1-5)  with  a  digital  computer  provides  the  capability  to 
rapidly  update  feedback  gains. 

In  this  application  only  one  set  of  closed  loop  poles  is  assigned 
in  solving  for  the  feedback  gains  from  Eq.  (9. 1-4).  However  different  sets 
of  closed  loop  poles  —  £m^,  i  =  1, . .  ,M  —  could  be  specified  for  different 
ranges  of  flight  conditions.  For  example,  if  the  autopilot  closed  loop  poles 
are  required  to  have  smaller  magnitudes  near  the  beginning  of  the  trajec¬ 
tory  than  those  specified  in  Eq.  (9.1-8),  then  less  frequent  gain  calculations 
are  needed.  This  effect  is  also  a  result  of  the  fact  that  the  sensitivity  of  the 
closed  loop  poles  to  changes  in  parameters  is  partially  determined  by  the 
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gains  h.  The  larger  are  the  magnitudes  of  the  elements  of  h,  the  greater 
is  the  effect  of  parameter  variations.  Reducing  the  magnitude  of  the 
desired  closed  loop  poles  tends  to  reduce  the  level  of  the  gains  and  thus 
reduce  sensitivity.  With  this  modification  to  the  pole  assignment  method, 
Eq.  (9. 1-4)  symbolically  becomes 


(9.1-9) 


where  both  a  and  gmi  change  adaptively,  the  former  as  new  estimates  are 
generated  and  the  latter  as  flight  conditions  change.  The  dependence  of 
the  gains  on  the  assigned  poles  is  explicitly  provided  by  Eq.  (9. 1-3). 


The  above  proposal  is  another  type  of  adaptive  reference  model 
(see  Section  8.2-4)  where  the  model  specifications  (i.e.,  assigned  poles) 
are  adjusted  with  changes  in  flight  condition  so  that  acceptable  system 
behavior  is  obtained.  It  is  consistent  with  the  idea  that  the  designer  should 
not  expect  the  control  system  to  produce  fixed  response  characteristics 
that  are  unreasonable  for  some  operating  conditions. 


9.2  THE  ADAPTIVE  OPTIMAL  REGULATOR 

hi  contrast  with  the  pole  assignment  method  discussed  previously, 
optimal  control  techniques  achieve  a  compromise  between  the  objectives  of 
obtaining  good  response  characteristics  and  maintaining  acceptable  con¬ 
trol  levels.  In  this  section  the  adaptive  optimal  regulator  described  in 
Section  5.4.1  and  Appendix  B  is  implemented  using  trajectory  data  provided 
in  Section  H.2.  The  term  "adaptive"  denotes  the  fact  that  a  method  is  pro¬ 
vided  for  recomputing  optimal  feedback  gains  on-line  as  new  estimates  of 
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plant  parameters  are  obtained.  Both  optimal  and  suboptimal  designs  are 
considered. 


9.2.1  Design  of  a  Third  Order  Adaptive  Optimal  Regulator 

In  this  section  we  apply  optimal  control  techniques  to  design  an 
adaptive  autopilot  fcr  a  tactical  missile  having  a  first  order  actuator  and 
a  second  order  airframe  with  dynamics  specified  in  Eqs.  (8. 1-3)  and 
(8.1-4).  The  problem  as  outlined  in  Section  5.4. 1  is  to  assume  v(t)  is 
zero  in  Eq.  (9.1-1)  and  determine  u(t)  in  the  equations  of  motion, 

x(t)  =  Ax(t)+bu(t);  x(t0)  =  x^  (9.2-1) 

such  that  the  performance  index 


J 


Qx(r)  +  ru(t)* 


dt 


(9.2-2) 


is  minimized.  This  problem  formulation  leads  to  solutions  for  the  gains 
h  and  k  in  Eq.  (9.1-1)  which  are  given  in  Eqs.  (5.4-4),  (5.4-5)  and  (5.4-8) 

in  terms  of  the  parameter  estimates.  These  expressions  are  repeated  here 
with  a  substituted  for  a  : 

~  — o 


h(a)T  =  ~  b(a)T 

(9.2-3) 

-SA(a)  -  A(a)TS  +  £  Sb(a)  b(a)T  S  -  Q  =  0. 

(9.2-4) 

k(a)  = - - - - 

(9.2-5) 

cT  ^A(a)  -b(a)  h(a)Tj  b(a) 
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Recall  that  each  calculation  for  the  gains  h(a)  in  Eq.  (9.2-3)  requires  the 
matrix  S(a),  which  is  the  solution  to  the  nonlinear  steady  state  Riccati 
eauation  in  Eq.  (9.2-4).  This  generally  must  be  accomplished  by  an  itera¬ 
tive  procedure.  One  efficient  method  that  is  suitable  for  low  order  systems 
is  the  Newton -Raphson  technique  discussed  in  Appendix  F.  Applied  to  this 
problem,  an  approximate  solution  for  S(a)  is  obtained  by  iteratively  solving 
the  linear  matrix  equations 

(A-ibbTsJTSk  +  1  +  Sk  +  1(A-ibbTSk)+iskbbTSk+Q  -  0 


(9.2-6) 

In  order  that  the  sequence  —  S^,  S^, . . .  —  converges  to  the  proper  value,  the 
starting  value  SQ  is  chosen  such  that  the  eigenvalues  of  the  matrix 

(A-ibbTS0)  (9.2-7) 

all  have  hegative  real  parts.  Enough  iterations  are  performed  so  that 
acceptable  accuracy  in  S  is  achieved;  typically  five  to  ten  are  required  for 
the  simulations  described  here. 

The  problem  of  determining  how  often  to  calculate  h(a)  is  more 
important  here  than  in  the  pole  assignment  method  because  of  the  increased 

A 

computational  load  imposed  by  the  need  to  determine  S(a).  One  method  for 
deciding  this  ques^on  is  to  evaluate  the  performance  index  J  for  the  repre¬ 
sentative  trajectory  as  a  function  of  the  number,  N,  of  gain  lecalculations. 
Presumably,  if  h(a)  is  updated  too  infrequently,  it  will  differ  widely  from 
^amurn  value  along  significant  portions  of  the  trajectory  because  of 
airframe  parameter  variations.  This  fact  should  be  reflected  in  an  increase 
in  J  as  N  decreases.  Hopefully  a  computation  interval  can  be  found  that  is 
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short  enough  to  yield  an  acceptably  small  value  of  J  and  yet  is  long  enough 
with  respect  to  the  system  response  time  so  the  infinite  upper  limit  in  the 
integral  of  Eq.  (9.2-2)  remains  valid.  If  this  cannot  be  done,  it  implies 
that  system  parameters  are  varying  too  rapidly  for  adaptive  steady  state 
regulator  gains  to  yield  an  adequate  controller  design. 

The  performance  evaluation  described  above  was  carried  out 
over  the  same  50  second  trajectory  used  in  Section  9.1  by  calculating 


where  the  times  t.  are  equally  spaced  points  at  which  h  is  recalculated 
according  to  Eq.  (9.2-3),  using  parameter  estimates  a ..  The  parameters 
were  assumed  to  be  perfectly  estimated  in  this  simulation.  The  elements 
of  A  and  b  in  Eq.  (9.2-1)  were  taken  from  Table  H.6  with  linear  interpola¬ 
tion  between  the  given  flight  conditions.  At  times  t. ,  the  following  opera¬ 
tions  were  performed:  A  and  b  were  updated  in  Eqs.  (9.2-6)  and  (9.2-7), 
an  approximate  solution  was  obtained  for  S(sL)  by  Newton- Raphson  iteration, 
and  the  adaptive  gains  were  recomputed.  Two  different  values  of  the 
weighting  matrix  Q  were  chosen, 


l.OxlO"5 

I 

o 

o 

1.0  x  10”2  0  0 

V 

0 

0 

2.0xl0“8  0 

0  1 

;  Q2  ~ 

0  2.0xl0“8  0 

0  0  1 

-  m 

(9.2-8) 

with  a  fixed  value  of  r,  r  =  1.0.  The  equations  of  motion  were  ’’driven"  by 
taking  the  input  v(t)  as  zero  and  setting  the  state  of  the  autopilot  to  a  speci¬ 
fied  value,  dt  x  at  one  second  intervals,  with  the  sign  alternating  at 
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successive  intervals.  This  simulates  a  situation  where  the  system  has 
been  allowed  to  reach  steady  state  in  response  to  a  constant  input 
command  and  then  v(t)  is  suddently  changed  to  zero. 

The  variation  in  J  as  a  function  of  N  is  plotted  in  Fig.  8. 2-1  for 
both  weighting  matrices  and  Q^.  Only  integer  values  of  N  have  meaning 

but  points  are  connected  by  straight  lines  to  indicate  trends.  The  value 
N  =  0  corresponds  to  no  feedback  compensation  at  all,  i.e. ,  h  =  0.  The 
points  where  J  has  the  value  "o d'  occur  when  the  system  becomes  unstable, 
resulting  in  an  essentially  '’infinite”  cost.  One  conclusion  is  that  too  few 
recalculations,  which  can  result  in  large  departures  of  the  gains  from  their 
optimum  values  as  parameters  vary,  are  worse  than  no  feedback  (h  =  0). 
This  sensitivity  problem  is  the  same  type  encountered  in  the  pole  assign¬ 
ment  technique  in  the  preceding  section.  With  weighting  Q2  the  system 
becomes  unstable  when  N  is  too  small;  the  same  behavior  is  observed  with 
the  pole  assignment  technique  if  the  feedback  gains  are  recalculated  too 
infrequently.  However,  it  is  clear  that  the  minimum  of  J  is  achieved  for 
all  practical  purposes  when  N  ^  10  for  both  choices  of  Q;  this  is  to  be  con¬ 
trasted  with  the  50  recalculations  required  to  maintain  stability  in  the  case 
illustrated  by  Fig.  9.1-3. 

To  compare  the  pole  assignment  method  and  the  optimal  regulator 
on  the  same  basis,  graphs  of  the  real  part  of  the  closed  loop  airframe  poles 
are  shown  in  Fig.  9.2-2  for  the  optimal  regulator  design  with  Q  =  Q2  and 
N  =  10.  This  value  of  the  weighting  matrix  is  used  throughout  our  subse¬ 
quent  discussion.  The  discontinuities  in  the  curve  at  5  second  intervals  are 
the  points  at  which  a  new  set  of  optimal  gains  is  computed.  In  the  same 
figure  the  curve  corresponding  to  the  uncompensated  airframe  (N  =  0)  is  also 
shown.  The  horizontal  dotted  line  at  the  value  -23  represents  the  design 
value  for  the  adaptive  pole  assignment  system. 
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K-H04 


Figure  9.2-1  Performance  Index  as  a  Function  of  the 
Number  of  Optimal  Gain  Recalculations: 

Adaptive  Optimal  Regulator 

The  weighting  matrix  for  the  performance  index  is  chosen  so  that 
the  optimal  regulator  feedback  gains  make  the  closed  loop  system  poles  at 
25  seconds  equal  to  those  specified  for  the  pole  assignment  method  in 
Eq.  (9. 1-8).  In  other  words  the  missile  autopilot  using  the  optimal  regu¬ 
lator  gains  has  the  same  dynamics  as  the  adaptive  pole  assignment  system 
at  one  instant  of  time  along  the  missile's  trajectory.  This  provides  a  basis 
of  comparison  for  the  performance  of  the  two  systems.  We  can  suppose 
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Figure  9.2-2  Value  o f  the  Real  Part  of  the  Most 
Significant  Closed  Loop  Pole: 

Optimal  Regulator  Method 

that  ideally  the  instantaneous  system  closed  loop  poles  should  always  have 
the  values  specified  in  Eq.  (9. 1-8).  This  can  be  assured  using  adaptive 
pole  assignment  by  recomputing  the  gains  sufficiently  often.  On  the  other 
hand  the  optimal  regulator  provides  the  desired  response  characteristics 
at  only  one  instant.  At  other  points  on  the  trajectory,  Fig.  9.2-2  indicates 
a  shorter  or  longer  settling  time  (larger  or  smaller  a)  depending  upon  the 
compromise  between  response  speed  and  control  level  dictated  by  the 
minimization  of  the  performance  index  in  Eq.  (9. 2-2).  This  is  exactly  the 
sort  of  comparison  one  expects  because  the  pole  assignment  technique 
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places  no  penalty  on  the  control  magnitude  required  to  give  the  desired  out¬ 
put  performance.  These  comparisons  are  also  supported  by  the  output 
response  curves  presented  in  Section  9.4. 

The  main  objection  to  using  the  optimal  regulator  design  is  that 
one  has  no  direct  control  over  how  much  the  closed  loop  response  charac¬ 
teristics  vary  as  plant  parameters  change.  The  designer  may  be  willing  to 
accept  some  deviation  from  criteria  such  as  those  specified  in  Eq.  (9. 1-8), 
but  variations  of  the  kind  indicated  in  Fig.  9.2-2  may  be  too  large  — 
especially  near  the  beginning  of  the  trajectory.  More  detailed  discussion 
of  this  point  is  given  in  Section  5.4. 1  a>id  Appendix  B.  Possible  alterna¬ 
tive  methods  are  provided  by  the  optimal  model  following  systems  described 
in  Sections  5.4.2  and  5.4.3  which  have  the  capability  for  obtaining  more 
uniform  output  performance  characteristics  over  a  range  of  flight  conditions. 
One  of  these  is  examined  in  Section  9.3. 


9.2.2  Design  of  a  Third  Order  Adaptive  Suboptimal  Regulator 

Many  linear  optimal  control  problems  suffer  from  the  "curse  of 
dimensionality"  in  that  a  large  number  of  variables  are  required  to  des¬ 
cribe  the  plant  dynamics  and  considerable  computation  is  required  to  obtain 
optimum  feedback  gains.  This  is  a  very  important  consideration  in  adap¬ 
tive  systems  when  the  feedback  gains  must  be  updated  on-line  by  solving  the 
matrix  Riccati  equation.  To  accomplish  this  task  using  the  Newton -Raphson 
iteration  (Eq.  (9.2-6)),  the  computation  time  required  is  proportional  to  n® 
where  n  is  the  dimension  of  the  state.  Hence  even  a  small  reduction  in  the 
number  of  state  variables  can  substantially  reduce  the  computational  load. 
Consequently  it  is  desirable  to  consider  suboptimal  contro_  laws  which 
neglect  some  of  the  state  variables.  This  is  often  justified  in  linear  systems 
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when  certain  open  loop  poles  do  not  contribute  significantly  to  the  open  loop 
response.  Such  is  the  case  in  missiles  if  dynamics  introduced  by  the 
flexible  airframe,  actuator  and  sensors  are  negligible  compared  with  those 
of  the  airframe’s  rigid  body  motion.  In  this  section  we  evaluate  the  sys¬ 
tem  design  obtained  by  solving  the  optimal  control  problem  with  the  actuator 
dynamics  in  Eq.  (8. 1-3)  neglected,  and  then  implementing  the  resulting 
feedback  gains  with  the  actuator  included. 

The  equations  of  motion  for  the  second  order  system  correspond¬ 
ing  to  Eq.  (8. 1-3)  with  6{t)  regarded  as  the  control  variable  are 


w 

M 

q 

M 

a 

q(t)  ‘ 

-Av 

'Ms' 

m 

l 

-L 

a 

oit) 

;L« 

•6(t) 


(9.2-9) 


where  a(t)  is  the  angle  to  attack.  Comparing  the  third  order  and  second 
order  systems,  make  the  identifications 


u(t)  =  6(t) 

att)  =  y£-  (a(t)-  VL.  6(t>) 

/V  \  ' 


(9.2-10) 


Denoting  the  diagonal  elements  of  Q  in  Eq.  (9. 2-2)  by  q^,  q22  and  qg3  with 
off-diagonal  terms  equal  to  zero  and  substituting  for  a(t)  from  Eq.  (9.2-10), 
the  performance  index  J  in  Eq.  (9.2-2)  becomes 

00 

J  =  5  [qnq(t)2  +  q22V2 (Lfto<t)  +  Lfiu(t))2  +  (q33  +r)  u(t)2  j  dt 


(9.2-11) 
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If  the  optimal  feedback  gains  for  minimizing  J  in  Eq.  (9.2-11)  for  the 
second  order  system  are  h'  and  hg,  the  corresponding  values  of  the  gains 
for  the  third  order  system  illustrated  in  Fig.  9.1-2  are  obtained  by  setting 

h^q(t)  +  hgO(t)  =  h^t)  +  h2a(t)  +  h36(t) 

Substitution  for  a(t)  from  Eq.  (9.2-10)  into  the  above  expression  and  equat¬ 
ing  coefficients  of  like  state  variables  on  each  side  of  the  equation  produces 


h,  = 


hl 


h„  = 


h2L5 


a 


h2 


h2 

VL 


a 


(9.2-12) 


Plots  of  the  real  part  of  the  dominant  pole  for  both  the  suboptimal 
and  optimal  adaptive  systems  are  shown  in  Fig.  9.2-3  for  two  different 
values  of  the  actuator  pole,  -X.  In  both  eases  the  feedback  gains  are  recal¬ 
culated  at  one  second  intervals  (N  =  50)  and  with  Q  =  Q„.  Observe  that  for 

-3  ^ 

X  as  large  as  200  (actuator  time  constant  =  5  x  10  sec. )  there  is  appre¬ 
ciable  deviation  in  the  optimal  and  suboptimal  curves  during  the  later  por¬ 
tion  of  the  trajectory,  as  the  system  closed  loop  poles  move  further  into 
the  left-half -plane.  The  conclusion  is  that  in  order  for  the  second  order 
design  to  be  a  good  approximation  to  the  third  order  design,  the  actuator 
poles  must  not  only  be  far  removed  from  those  of  the  uncompensated  mis¬ 
sile  airframe,  but  they  should  also  be  some  distance  from  those  of  the 
compensated  airframe.  This  condition  does  not  hold  for  all  flight  con¬ 
ditions  in  the  system  we  are  considering  (X  =  50);  consequently  actuator 
dynamics  are  not  neglected  in  subsequent  sections. 


9-17 


THE  ANALYTIC  SCIENCES  CORPORATION 


/  -no*  1-2)02 


Figure  9.2-3  Comparison  of  the  Vaiues  of  the  Real  Part  of  the 
Most  Significant  Closed  Loop  Pole  for  the  Optimal 
and  Suboptimal  Regulator  Designs 


9. 3  ADAPTIVE  OPTIMAL  MODEL  FOLLOWING  SYSTEMS 

In  this  section  we  apply  the  "model-in-the-performance-index" 
model  following  control  technique  described  in  Section  5.4. 2  to  a  missile 
autopilot.  This  method  permits  adaptive  regulation  of  control  system 
response  characteristics  along  a  trajectory  by  defining  a  performance  cri¬ 
terion  in  terms  of  the  error  between  the  actual  (measured)  and  the  desired 
airframe  djnr  mic  properties.  The  error  signal  is  given  by 
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§T(t)  =  (A(a)  -  Am)  x(t)  +  b(a)  u(t),  (9.3-1) 


as  in  Eq.  (5.4-13),  where  the  matrix  Am  represents  the  dynamics  of  a 
reference  model, 


4n(t)  *  Am^m(t) 


The  objective  is  to  determine  u(t)  so  that 


J  Fe(t)T  Qe(t)  +  ru(t)2 1  dt  (9.3-2) 

‘o 


is  minimized.  Again  the  notation  A(a)  and  b(a)  is  used  to  emphasize  the 
dependence  of  the  control  upon  parameter  estimates.  This  problem  formu¬ 
lation  leads  to  solutions  for  the  gains  h  and  k  in  Eq.  (9.1-1)  which  are  given 
in  Eqs.  (5.4-14),  (5.4-15)  and  (5.4-8).  These  expressions  are  repeated 

here  with  a  substituted  for  a  : 

—  — o 


.  T  b(a)T 

h(a)T  -  — — 

r(a) 

r(a)  -  b(a)T  Qb(a)  +  r 
H(a)  =  A(a)  -  Am 


k(a)  = - - - ^ -  (9.3-3) 

cT  (A(a)  -b(a)  h(a)T)  b(a) 
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where  S(a)  satisfies 

-S  A(a)  -  A(a)T  S+  —  Sb(a)  b(a)T  S-  H(a)TQ(a)H(a)  =  0 
r(a) 

A(a)  =  A(a)  — V  b(a)  b(a)T  QH(a) 
r(a) 

Q(a)  =  Q  —  Qb(a)  b(a)T  Q  (9.3-4) 

r(a) 

Equation  (9.3-4)  is  solved  iteratively  for  S(a)  using  the  same  Newton- 
Raphson  method  described  in  the  preceding  section.  The  response  of  the 
airframe  tends  to  be  similar  to  that  of  the  model  to  the  extent  that  the  dif¬ 
ference,  AA,  between  the  dynamics  of  the  reference  model  and  the  closed 
loop  control  system,  as  expressed  by 

AA  =  (A(a)  -  b(a)  h(a)T  -  Am)  ,  (9. 3-5) 

is  made  small  by  the  resulting  value  of  h. 

Simulations  of  this  technique  were  performed  with  Am  given  by 

Am=(A-5!lm)  (9-3-6> 

where  A  and  b  have  the  values  for  the  trajectory  used  in  Sections  9.1  and 
9. 2  at  time  t  =  25  seconds  and  h  is  the  solution  to  the  optimal  regulator 
problem  at  the  same  instant  with  Q  =  Q2*  111  this  way  a  consistent  compari¬ 
son  is  made  between  the  model  following  system  and  the  other  control  tech¬ 
niques  evaluated  in  this  chapter.  Numerical  values  for  the  model  param¬ 
eters  are 
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-7.64 
-1.51  xlO4 
4.31 


0.041  -454.0 

3.53  8.47  xlO4 

4.  47  x  lO-2  -95.8 


(9.3-7) 


Appropriate  weighting  constants  for  the  performance  index  are 


q 


*0.3  0  0 

0  2.0  xlO"7  0 

0  0  1.0 


r  =  1.0 


(9.3-8) 


where  q  is  an  adjustable  parameter  to  provide  comparative  simulation 
results. 


The  real  part  of  the  most  significant  pole  along  the  missile  tra¬ 
jectory  is  displayed  in  Fig.  9.3-1  forthree  different  values  ofq  and  with  the 
feedback  gains  calculated  at  one  second  intervals.  The  dashed  horizontal 
line  at  a  =  -23.0  represents  the  specifications  in  Eq.  (9.1-8)  on  which  the 
pole  assignment  technique  was  based.  Observe  that  as  q  increases,  at 
t  =  25  seconds  the  value  of  a  for  the  model  following  system  appears  to 
approach  a  limit  that  coincides  with  both  the  curve  for  the  optimal  regulator 
and  the  horizontal  dashed  line.  This  behavior  is  a  consequence  of  the  fact 
that  as  q  is  increased  (or  alternatively  as  r  is  decreased)  the  optimal  feed¬ 
back  gains  h  also  approach  a  limit,  as  discussed  in  Section  5.4. 2.  When 
t  =  25  seconds  this  limit  is  identical  with  the  feedback  gains  in  the  optimal 
regulator  and  the  pole  assignment  system  because  the  reference  model 
dynamics  defined  by  Eqs.  (9.3-6)  and  (9.3-7)  are  identical  to  those  of  the 
optimal  regulator  at  25  seconds.  That  is  to  say, 

lim  AA 1  =0 

q-.®  -It  =  25 
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t-not 


Figure  9.3-1  Values  of  the  Real  Part  of  the  Most 
Significant  Closed  Loop  Pole: 
Optimal  Model  Following  System 


where  AA  is  defined  in  Eq.  (9. 3-5).  Consequently  the  model  in  the  per¬ 
formance  index  (with  large  q),  the  optimal  regulator,  and  the  pole  assign¬ 
ment  methods  all  yield  the  same  controller  at  one  point  on  the  trajectory. 
This  property  permits  a  meaningful  comparison  of  the  three  techniques  at 
other  points  along  the  trajectory. 

As  indicated  in  Fig.  9.3-1,  for  a  large  value  of  q  the  tendency  of 
this  controA  system  to  conform  to  the  model  (in  terms  of  the  location  of  the 
most  significant  closed  loop  pole)  is  moderately  better  than  the  optimal  regu¬ 
lator  for  t  >  25  sec  and  is  about  the  same  as  the  regulator  for  t  <25  sec. 
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Deviations  from  the  model  are  a  result  of  our  inability  to  choose  h(t)to  make 
AA  identically  zero  everywhere  on  the  trajectory.  Consequently  a  com¬ 
promise  between  control  level  and  response  characteristics  is  exerted  by 
the  optimization  criterion,  just  as  in  the  regulator  method. 

The  primary  advantage  offered  by  the  model  following  scheme  for 
this  application,  relative  to  the  optimal  regulator,  is  a  conceptual  one.  If 
one  deliberately  specifies  a  set  of  model  dynamics  Am  that  the  compensa¬ 
ted  airframe  can  duplicate  at  one  particular  flight  condition,  then  an  optimal 
control  law  which  minimizes  J  in  Eq.  (9.3-2)  closely  approximates  the 
model  at  that  flight  condition  if  the  performance  index  weighting  constant  q 
in  Eq.  (9.3-8)  is  sufficiently  large.  By  contrast,  it  is  more  difficult  to 
determine  what  choices  of  Q  and  r  in  Eq.  (9.2-2)  realize  specified  dynamics 
at  any  flight  condition.  However,  this  distinction  may  be  somewhat  academic 
from  the  designer's  point  of  view  because  the  performance  indices  are  chosen 
"off-line"  where  there  is  sufficient  time  to  evaluate  different  values  of  Q 
and  r. 


The  computational  requirements  for  this  model  following  system 
are  somewhat  greater  than  for  the  optimal  regulator  as  indicated  by  com¬ 
paring  Eqs.  (9.3-3)  and  (9.3-4)  with  Eqs.  (9.2-3)  and  (9.2-4).  This  dis¬ 
advantage  weighed  againfct  the  small  improvement  obtained  in  Fig.  9.3-1 
indicates  that  the  model-in-the-performance-index  technique  investigated 
here  is  relatively  unattractive  for  controlling  longitudinal  airframe  motion. 
For  other  applications  where  there  is  more  than  one  plant  input  variable 
(e.g. ,  lateral  motion  control)  a  better  match  between  system  and  model 
dynamics  might  be  achieved  over  the  entire  trajectory  using  the  model 
following  method  discussed  above. 

It  is  noted  in  Section  5.4.3  that  the  model-in-the-performance- 
index  method  discussed  above  can  be  made  to  yield  more  uniform 
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performance  by  transforming  state  variables  to  phase  variable  canonical 
form.  The  resulting  adaptive  control  law  requires  some  additional  com¬ 
putation  to  derive  the  required  transformation  matrix.  This  possibility  is 
not  pursued  here. 

Another  type  of  model  following  technique  considered  in  Section 
5.4  is  the  "model-in-the-system"  approach  having  the  structure  indicated 
in  Fig.  5.4  -2.  It  is  capable  of  making  the  airframe  and  reference  model 
dynamics  approximately  identical  for  a  v-ide  range  of  flight  conditions,  at 
the  expense  of  requiring  a  high  gain  type  feedback  structure  and  more  com¬ 
plex  computations  (see  Eqs.  (5.4-19)  and  (5.4-20)).  This  method  is  not 
evaluated  here  because  nearly  the  same  performance  characteristics,  with¬ 
out  the  above  stated  disadvantages,  are  achievable  using  the  pole  assign¬ 
ment  method. 


9.4  RESPONSE  COMPARISONS 

From  the  designer's  point  of  view,  the  best  comparison  among 
different  control  methods  is  provided  by  simulation  of  the  time  histories 
of  the  control  variables  and  important  state  variables.  This  permits  one 
to  examine  the  actual  transient  characteristics  of  the  system  in  response 
to  representative  input  commands,  thus  obtaining  empirical  knowledge  of 
such  parameters  as  rise  time,  overshoot,  and  settling  time. 

To  perform  this  type  of  study,  the  operation  of  the  three  adaptive 
controllers  —  pole  assignment,  optimal  regulator,  and  model  following  — 
described  in  the  preceding  sections  was  simulated  along  the  surface-to-air 
trajectory  described  in  Section  H.  2.  Airframe  dynamics  were  varied  with 
time  according  to  the  linear  interpolation  technique  described  in  Section  H.l. 
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The  commanded  acceleration  v(t)  is  a  piecewise  constant  function  which 
changes  level  and  alternates  its  sign  at  one  second  intervals  according  to 

v(n  +  l)  =  -f(n  +  l)  sign  [v(n)] ;  n  =  l,2, ... 
v(0)  =  f(0) 

where  f(t)  is  specified  graphically  in  Fig.  9.4-1.  The  use  of  acceleration 
commands  that  gradually  increase  in  magnitude  is  reasonable  if  the  missile 
is  launched  some  distance  from  a  target.  The  adaptive  gains  are  also  up¬ 
dated  at  one  second  intervals  along  the  trajectory.  Graphs  of  the  corre¬ 
sponding  control  surface  deflection  6(t)  and  normal  acceleration  a(t)  are 
shown  in  Figs.  9.4-2(a)  through  (e)over  selected  one  second  intervals  along 
the  trajectory.  The  magnitude  of  the  control  surface  deflection  is  of 
interest  because  in  practice  it  usually  has  an  upper  bound  that  cannot  be 
exceeded;  furthermore  it  is  an  indication  of  the  control  level  required  at 
the  actuator  input. 


TIME  (sec) 

Figure  9.4-1  Magnitude  Profile  of  Commanded  Acceleration 
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(a)  DYNAMIC  PRESSURE  SS000  Ib./f? 
MACH  NUMBER  =  4.5 
RESPONSES  FOR  AU  AUTOPILOTS 


Autopilot  Response  Histories  Along  a 
Simulated  Missile  Trajectory  for  Three 
Adaptive  Control  Systems  Using  Explicit 
Plant  Identification 


Figure  9.4-2 
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Figure  9.4-2  (Cont. )  Autopilot  Response  Histories  Along  a  Simulated 

Missile  Trajectory  for  Three  Adaptive  Control 
Systems  Using  Explicit  Plant  Identification 
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Figure  9. 4-2  (Cont. )  Autopilot  Response  Histories  Along  a  Simulated 

Missile  Trajectory  for  Three  Adaptive  Control 
Systems  Using  Explicit  Plant  Identification 
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Figure  9.4-2  (Cont.)  Autopilot  Response  Histories  Along  a  Simulated 

Missile  Trajectory  for  Three  Adaptive  Control 
Systems  Using  Explicit  Plant  Identification 
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Figure  9.4-2  (Cont.)  Autopilot  Response  Histories  Along  a  Simulated 

Missile  Trajectory  for  Three  Adaptive  Control 
Systems  Using  Explicit  Plant  Identification 
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Recall  that  all  three  control  methods  are  designed  to  give 
identical  performance  for  the  flight  condition  corresponding  to  t  =  25 
seconds  along  the  trajectory.  Therefore  on  the  interval  25  s  t  s:  26,  the 
sets  of  time  histories  for  a(t)  and  6(t)  are  identical  as  shown  in  Fig. 

9.4-2(a),  having  negligible  overshoot  and  a  settling  time  of  about  0. 15 
seconds.  In  Figs.  9.4-2(b)  through  (e),  a(t)  and  6(t)  are  plotted  at  the 
beginning  of  several  different  one  second  intervals.  At  t  =  0  (Fig.  (b)) 
when  dynamic  pressure  and  mach  number  are  lowest,  the  greatest  dif¬ 
ferences  in  response  are  observed  among  the  three  systems.  The  pole 
assignment  method,  which  maintains  constant  closed  loop  pole  locations 
throughout  the  trajectory,  exhibits  relatively  large  amplitude  transient 
oscillations  immediately  after  the  command  v(t)  is  applied.  This  be¬ 
havior  is  caused  by  the  fact  that  a  relatively  large  change  in  angle  of  attack 
is  required  to  produce  a  prescribed  change  in  normal  acceleration  at  this 
flight  condition.  Consequently  in  response  to  v(t)  the  control  surface  de¬ 
flection  must  be  relatively  large,  compared  with  its  steady  state  value,  to 
produce  the  required  pitching  moment  and  to  compensate  for  the  reduced 
control  surface  effectiveness.  By  contrast  both  optimal  systems  permit  a 
much  slower  speed  of  response  at  this  flight  condition  and  consequently  have 
smaller  transient  fluctuations.  However,  at  the  other  times  shown 
(Figs,  (c)  through  (e))  the  behavior  of  the  pole  assignment  system  is  quite 
uniform  without  calling  for  excessive  control  surface  deflection.  As 
expected,  the  optimal  system  responses  vary  along  the  trajectory,  generally 
becoming  faster  as  dynamic  pressure  and  mach  number  increase.  Also  they 
tend  to  exhibit  some  overshoot  in  acceleration,  particularly  in  Fig.  (c). 

The  conclusion  from  this  demonstration  is  that  the  pole  assign¬ 
ment  method  yields  acceptable  behavior  over  a  wide  range  of  flight  condi¬ 
tions,  excepting  those  where  both  mach  number  and  dynamic  pressure  are 
lowest.  The  latter  condition  typically  occurs  just  after  launch,  before  the 
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missile  has  accelerated  to  its  operating  velocity.  If  erne  insists  cm  a  rapid 
response  during  the  initial  acceleration  phase,  the  large  transient  pulse  of 
acceleration  in  the  wrong  direction  shown  in  Fig.  9.4-2(b)  is  physically 
unavoidable  because  of  the  required  control  surface  deflection.  To  avoid 
excessive*  control  surface  deflections  with  the  pole  assignment  technique 
the  command  input  signal  level  must  be  kept  low  during  this  period.  If 
nonuniform  response  characteristics  can  be  tolerated  during  periods  of  low 
dynamic  pressure  and  mach  number,  the  adaptive  optimal  control  techniques 
can  be  used  to  limit  control  surface  deflections.  However,  as  suggested  in 
Section  9.1,  the  latter  can  also  be  accomplished  by  the  pole  assignment 
method,  using  different  sets  of  poles  (i.e. ,  adaptive  closed  loop  poles)  for 
different  ranges  of  flight  conditions.  This  seems  to  be  the  preferred  pro¬ 
cedure  in  view  of  the  fact  that  the  optimal  methods  require  more  computer 
memory  and  more  on-line  computation  for  each  recalculation  of  the  adap¬ 
tive  gains. 


The  above  observations  are  summarized  in  Table  9.4.1.  Because 
the  pole  assignment  technique  has  relatively  low  computational  requirements 
it  is  considered  to  be  the  most  promising.  As  suggested  above,  any  require¬ 
ment  for  variations  in  autopilot  response  characteristics  can  be  accommodated 
by  determining  a  few  sets  of  closed  loop  poles,  p—,  i  =  1, . .  ,M,  which  are 
suitable  for  different  flight  regimes.  The  feedback  gains  are  calculated 
adaptively  according  to  Eq.  (9. 1-9).  It  is  anticipated  that  only  a  few  dif¬ 
ferent  sets  of  poles  will  be  required  in  view  of  the  generally  good  perform¬ 
ance  exhibited  by  the  pole  assignment  method  in  Fig.  9.4-1. 


* 

In  a  practical  situation  there  are  limits  on  the  allowable  control  surface 
deflection  which  the  linear  control  system  design  techniques  considered 
here  do  not  take  into  account. 
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TABLE  9.4-1 

COMPARISON  OF  ADAPTIVE  CONTROL  TECHNIQUES 
REQUIRING  EXPLICIT  PLANT  IDENTIFICATION 


PERFORMANCE  CRITERIA 

MOST 

SATISFACTORY 

LEAST 

SATISFACTORY 

Computational 

Complexity 

Programming  Instructions, 
Storage,  Amount  of  Computation 
per  Gain  Updating 

Pole 

Assignment 

Optimal  Model 
Following  System 

Number  of  Adaptive 

Gain  Updatings 

Per  Unit  Time 

Both  Optimal 
Systems 

Pole 

Assignment 

Uniform  Response  Characteristics 

Pole 

Assignment 

Optimal 

Regulator 

Control  Level  Required 

Both  Optimal 
Systems 

Pole  Assignment 
(Low  Dynamic 
Pressure  and  Low 
Mach  Number) 

Optimal  control  techniques  may  be  useful  as  an  off-line  design 
aid  to  determine  suitable  values  for  the  closed  loop  poles.  However, 
because  the  closed  loop  dynamics  of  adaptive  optimal  systems  vary  in  an 
unknown  fashion  as  plant  parameters  change,  they  are  not  as  suitable  for 
obtaining  particular  response  characteristics  as  the  pole  assignment 
method.  The  added  computational  capability  required  to  implement  adaptive 
optimal  methods  is  better  devoted  to  estimating  plant  parameters  as 
accurately  as  possible  via  the  methods  discussed  in  Chapter  6. 
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S.  5  RAPIDLY  VARYING  PLANT  PARAMETERS 

In  the  preceding  sections  of  this  chapter,  adaptive  control 
techniques  have  been  applied  to  a  situation  where  plant  parameters  vary 
relatively  slowly  with  respect  to  the  response  time  of  the  closed  loop  con¬ 
trol  system.  When  missile  flight  conditions  are  changing  very  rapidly,  as 
in  dogfight  applications,  there  is  little  theoretical  justification  for  updating 
feedback  gains  by  successively  solving  for  the  optimal  control  which  mini¬ 
mizes  the  performance  index  in  Eq.  (S.2-2)  or  by  determining  those  gains 
which  maintain  constant  instantaneous  closed  loop  poles  as  in  Eq.  (9.1-4). 
Nevertheless,  these  methods  may  still  give  satisfactory  performance  if  the 
feedback  gains  are  recomputed  sufficiently  often.  In  this  section  we 
demonstrate  the  autopilot  acceleration  response  achieved  by  the  pole  assign¬ 
ment  technique  for  the  three  second  thrusting  period  of  the  trajectory  given 
in  Section  H.l,  during  which  time  the  missile’s  dynamic  pressure  changes 
by  a  factor  of  almost  twenty. 

Two  sets  of  closed-loop  poles  are  selected  for  the  airframe: 


'-50.0 

'-50.0 

S' 

-  2.8  +  j 

2.8  -  j 

4.3 

4.3 

;  s 

-  5.3  i-  j  5.1 
.-  5.3- j  5. 1 

The  set  is  used  in  the  interval  4  st<6  and  Pg  in  the  interval 
6  s:  t  s  9.  This  design  permits  controlled  variation  in  the  response  charac¬ 
teristics  to  allow  for  the  changing  effectiveness  of  aerodynamic  control  sur¬ 
faces.  The  feedback  gains  h  and  the  d-c  gain  k  are  computed  forty  times  per 
second*  from  Eqs.  (9. 1-9),  (9. 1-5),  and (9. 1-7)  with  am,  bm  arid  cm  deter¬ 
mined  by  equating  coefficients  in  the  expression 
* 

Satisfactory  behavior  was  also  observed  with  twenty  gain  recomputations 
per  second;  however  ten  per  second  led  to  instability. 
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s2  +  ams2  +  bms  +  cm  =  (s  *  Pil)(s  '  Pi2)(s  '  P13) 


.th 


where  p. .  is  the  j  element  of  p^. .  A  piecewise  constant  input  command 

J 


v(t)  =  ±  16.0  ft/sec2 


was  applied,  with  the  sign  switched  at  one  second  intervals. 


The  acceleration  response  and  the  control  surface  deflection  for 
the  airframe  control  system  are  displayed  in  Fig.  9.5-1.  Control  surface 
deflection  is  plotted  in  terms  of  the  normalized  quantity* 

6  =  1 6 1 1//2  sign  (6) 

The  curves  indicate  that  the  acceleration  settles  close  to  the  commanded 
value  in  an  interval  equal  to  about  three  of  the  time  constants  associated 
with  the  dominant  complex  poles  of  p^  and  gmg.  Durin?  the  first  part  of 
the  trajectory,  a  large  transient  acceleration  having  an  algebraic  sign 
opposite  to  that  of  v(t)  is  produced  by  the  control  surface.  This  is  another 
example  of  the  behavior  observed  in  Figs.  9.4-2(b)  and  (c).  For  this 
simulation  the  peak  deflection  required  in  Fig.  9. 5-l(b)  far  exceeds  the 
linear  range  of  any  missile  control  surface.  Therefore  in  order  to  actually 
achieve  the  acceleration  response  time  shown  in  the  interval  4.0  s  t  s  6.0, 
a  much  lower  level  of  v(t)  is  required.  Thus  one  concludes  that  this  par¬ 
ticular  missile  has  little  capability  for  following  steering  commands  during 
the  first  part  of  its  thrusting  phase. 


jjc 

This  normalization  is  defined  to  provide  a  convenient  scale 
in  Fig.  9.5-1, 
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|b)  NORMALIZED  CONTROL  SURFACE  DEFLECTION 

Figure  9. 5-1  Response  Characteristics  While  Airframe 
Parameters  are  Varying  Rapidly:  Autopilot 
Designed  Using  Pole  Assignment  Technique 
With  Adaptive  Closed  Loop  Poles 

A  more  positive  conclusion  about  the  above  simulation  is  that  the 
airframe  is  successfully  stabilized  by  the  pole  assignment  method  if  the 
feedback  gains  are  computed  sufficiently  rapidly  and  if  the  called  for  con¬ 
trol  levels  are  within  the  missile's  capability.  In  other  words,  the 
instantaneous  autopilot  closed  loop  poles  provide  a  good  indication  of  the 
response  characteristics,  even  though  the  airframe  parameters  vary  a 
large  amount  in  a  period  that  is  short  with  respect  to  the  desired  autopilot 
response  time.  Furthermore,  these  results  have  a  strong  bearing  on  the 
parameter  identification  procedure  used  to  obtain  a.  In  this  particular 
thrusting  missile,  parameter  estimates  must  be  obtained  at  a  rapid  rate  — 
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on  the  order  of  twenty  times  per  second.  Further  investigation  is  needed 
to  determine  which  identification  technique  is  best  stated  to  meet  this 
requirement.  We  have  pointed  out  that  the  basic  parameter  identification 
technique  described  in  Section  6. 3  has  the  potential  for  generating  esti¬ 
mates  very  rapidly. 


9.6  SUMMARY  AND  CONCLUSIONS 

Three  methods  of  adaptive  control  which  utilize  explicit  esti¬ 
mates  of  plant  parameters  have  been  evaluated  for  a  missile  autopilot. 
Each  one  operates  on  the  principle  of  updating  a  set  of  feedback  gains, 
assuming  that  the  plant  parameters  remain  constant  at  their  estimated 
values  for  all  future  time,  so  that  the  instantaneous  system  closed  loop 
poles  provide  an  indication  of  the  airframe  response  characteristics.  The 
most  promising  method  --  both  in  terms  of  mechanization  and  ability  to 
provide  uniform  response  characteristics  --is  the  pole  assignment 
technique. 


Each  adaptive  method  is  theoretically  justifiable  so  long  as  plant 
parameters  vary  slowly  compared  with  the  desired  airframe  response.  In 
a  dogfight  application  this  condition  does  not  hold;  however,  the  simulation 
results  in  Section  9.5  indicate  that  the  pole  assignment  technique  can  still 
provide  a  stable  autopilot,  within  the  capability  of  the  control  surfaces.  At 
low  velocities,  moderate  acceleration  steering  commands  and  a  rapid 
autopilot  response  can  require  control  levels  that  exceed  the  bounds  im¬ 
posed  by  constraints  on  aerodynamic  control  surface  deflections.  To 
improve  the  control  capability  at  low  velocities,  one  of  the  alternative  mis¬ 
sile  designs  discussed  in  Section  8.4.2  may  be  attractive. 
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An  important  topic  for  additional  study  is  an  evaluation  of 
parameter  identification  techniques  for  missile  applications  including  the 
effects  of  noisy  measurements.  There  is  a  special  need  to  determine 
which  methods  can  accomplish  the  extremely  rapid  parameter  estimation 
required  for  thrusting  missiles.  In  addition,  it  is  worthwhile  considering 
how  to  effectively  utilize  a  priori  information  that  may  be  available  --  such 
as  a  known  thrust  level,  a  known  launch  altitude,  etc.  --  to  determine  air¬ 
frame  parameters.  These  topics  are  beyond  the  scope  of  this  report  but 
they  should  be  the  subject  of  future  investigation. 
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10.  APPLICATIONS:  LOW  SENSITIVITY  CONTROL  SYSTEMS 


Lx  Chapter  7  a  number  of  techniques  useful  for  designing  control 
systems  that  are  insensitive  to  plant  operating  conditions  have  been  des¬ 
cribed.  Those  methods  which  have  the  capability  to  desensitize  the  system 
to  wide  ranges  of  parameter  variations  are  best  suited  for  tactical  missiles. 
Li  this  chapter  one  such  design  procedure  --  the  Liapunov  synthesis  tech¬ 
nique  described  in  Section  7.4  --  is  evaluated  for  a  pitch  rate  autopilot  ha\  - 
in g  time -varying  airframe  dynamics.  It  is  demonstrated  that  this  method 
has  the  capability  to  null  the  difference  between  the  outputs  of  the  autopilot 
and  a  reference  model. 


10.1  DESIGN  PROCEDURE 

As  described  in  Section  7.4,  the  Liapunov  synthesis  technique  for 
designing  an  insensitive  control  system  is  similar  to  the  Liapunov  synthesis 
technique  for  an  adaptive  controller  developed  in  Section  4.4. 3.  In  fact  the 
design  steps  are  identical  up  to  the  point  of  defining  a  Liapunov  function  and 
selecting  the  control  law.  Consequently  this  discussion  of  an  insensitive 
pitch  rate  autopilot  parallels  the  first  portion  of  Section  8.3. 1;  the  latter 
material  is  repeated  here  for  completeness. 

Let  the  input-output  relations  for  the  design  problem  be  repre¬ 
sented  by  Fig.  10. 1-1  in  analogy  with  Fig.  8. 3-1.  To  design  an  insensi¬ 
tive  nonadaptive  controller  we  proceed  as  in  Eqs.  (8.3-1)  through  (8.3-7). 

In  Laplace  transform  notation  the  equations  of  motion  for  the  plant  and  the 
model  are 
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Now  we  divide  both  sides  of  Eq.  (10.1-2)  by  the  polynomial 

Pc(s)  =  s  -  pt 

where  p^  is  a  negative  number  whose  allowed  range  of  values  is  to  be 
determined  presently.  The  result,  after  carrying  out  the  required  number 
of  steps  in  dividing  by  p  (s),  is 

V 


s+bm+pi)E(s)  = 


E<s)  +  (bm-b)Y(s) 


a  -a+p.,(b  -b) 

+  B — L  Y(s)  +  kU(s) 

s-pj 


+  U(s)  -  -kmV(s)  (10.1-3) 


which  has  the  same  form  as  Eq.  (7.4-5).  Now  for  stability  of  the  error 
signal  we  require  that 


b  +  p,  >  0 
m  1 


(Hi  the  left-hand-side  of  Eq.  (10.1-3),  or 


P-.  >  -b 
*1  m 


(10.1-4) 


This  provides  the  condition  needed  on  p  (s). 

V 


Referring  to  the  right-hand  side  of  Eq.  (10.1-3)  define  new 


variables 
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(10.1-5) 


(10.1-6) 


Rewriting  Eq.  (10.1-3)  in  the  time  domain  with  the  aid  of  Eqs.  (10. 1-5)  and 
(10.1-6),  produces  a  differential  equation  for  the  error  in  the  form  of 
Eq.  (7.4-15), 

6(t)  =  -  (bm  +pj)  e(t)  +  £Tf(t)  +  ku(t)  -kmv(t)  (10.1-7) 

At  this  point  the  design  procedure  departs  from  that  developed 
in  Section  8.3  and  continues  as  in  Section  7.4,  beginning  with  Eq.  (7.4-16). 
A  Liapunov  function, 

V(e)  =  ie2  (10.1-8) 

is  defined.  Differentiating  V  and  substituting  for  e(t)  from  Eq.  (10. 1-7) 
produces 


Y»  =  2®- 


s-p. 


E  (s)  =  -MfeL 
C  s-pj 


uc(s) 


=  JSsL 


s-p, 


V  (s)  -  -SSL 

c  S-Pj 


and  vector  quantities 


b  -b 
m 

am-a+pl(bm-b) 

Px-z 


m 


(zm~pl) 


-am-pl(bm+pl) 


’  y(t)  ‘ 

yc(*) 

;  f(t)  = 

uc(t) 

vc(t) 

ec(t) 
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^(t)  =  -  (bm  +P1)e(t)2  +  [joTf(t) +ku(t)  -kmv(t)j  e(t)  (10.1-9) 

To  design  a  controller  that  is  insensitive  to  variations  in  £  and  k,  choose 
the  control  u(t)  in  the  form  of  Eq.  (7. 4-24), 


where  I  1  denotes  the  maximum  value  of  the  argument  over  the  range  of 
max 

allowable  parameter  variations.  Observe  that  in  addition  to  the  ranges  of 
parameter  values,  the  sign  of  k  must  also  be  known  in  order  to  mechanize 
Eq.  (10.1-10).  Substitution  for  u(t)  from  Eq.  (10. 1-10)  into  Eq.  (10.1-9) 
yields  the  following  inequality  for  V{tj: 

V(t)  *  -  (bm  +  PtL)  eft)2;  |e(t)l  >  «  (10.1-11) 

Using  the  definition  of  V  in  Eq.  (i0. 1-8),  the  above  inequality  can  be 
rewritten  as 

V(t)  s-2  (bm+P1)v(t);  |e(t)|  >  t 

Therefore  it  is  inferred  that 

V(t)  s  V(0)  exp  j^-2  (bjn  +  pjt  ;  eft)  >  e  (10.1-12) 
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or,  equivalently 

e(t)2  s;  eCofexp  j^-2  (bm+p1^tj ;  je(t)|  >  c  (10.1-13) 

Consequently,  by  choosing  p^  in  Eq.  (10. 1-3)  so  that  the  quantity 
(bm+pi )  is  positive  (as  already  required  by  Eq.  (10. 1-4))  and  with  know¬ 
ledge  of  the  ranges  of  plant  parameter  variations  and  the  algebraic  sign  of 
the  plant  gain  k,  a  controller  can  be  designed  which  forces  e(t)  to  the  set 
of  values, 

!  e(t)  |  s  c  (10.1-14) 

exponentially.  Furthermore  if  the  output  error  is  initially  Tass  than  c, 

Eq.  (10  1-14)  is  always  satisfied;  that  is,  the  error  always  remains  below 
the  saturation  level.  A  block  diagram  of  the  control  system  is  shown  in 
Fig.  10.1-2.  The  choice  of  e,  which  specifies  the  saturation  character¬ 
istics  defined  in  Eq.  (10.1-10),  is  determined  by  jointly  considering  the 
effects  of  system  bandwidth  and  the  error  bound  provided  by  Eq.  (10.1-14). 
The  former  increases  and  the  latter  decreases  as  e  decreases.  In 
practice  it  is  found  that  the  error  remains  significantly  less  than  e  because 
the  nonlinear  gain  term  (in  brackets)  in  Eq.  (10.1-10)  is  usually  conserva¬ 
tively  large.  This  behavior  is  subsequently  demonstrated  in  simulation 
results. 

Before  presenting  simulation  results  it  is  worthwhile  interpreting 
Eq.  (10.1-10)  in  a  way  that  leads  to  a  less  complicated,  more  familiar  type 
of  control  law.  The  expression  for  u(t)  can  be  viewed  simply  as  a  nonlinear 
gain  multiplying  a  nonlinear  function  of  the  error,  viz. , 
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/  1H? 


Figure  10.1-2  Mechanization  of  Insensitive  Controller 

Based  on  Liapunov  Design  Procedure 


u(t)  =  -D  (f(t),  v(t))  sat(e(t)) 


D(f(t),v(t)) 


5 

h 

k 

k 

- 

E 

i  =  1 

h(t)|+ 

m 

k 

(V{t)l 

max 

max 

m 

sign(k)  (10.1-15) 


Disregarding  the  fact  that  D  is  a  nonlinear  function  of  the  input  and  various 
system  state  variables,  Fig.  10.1-2  can  be  redrawn  much  more  simply  as 
shown  in  Fig.  10.1-3.  That  is,  the  plant  input  u(t)  is  generated  by  passing 
the  output  error  through  a  large  saturating  gain  with  drive  (saturation)  level 
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K-iSS! 


Figure  10. 1-3  Alternative  Representation  for  Fig.  10.1-2 


D(f(t),  v(t)).  Now  the  purpose  of  D  in  Eq.  (10.1-15)  is  to  make  the 
second  term  on  the  right  side  of  Eq.  (10.1-9)  sufficiently  negative 
so  that  Eq.  (10.1-11)  holds.  If  it  is  conjectured  that  \ L(t) |  and  |v(t)| 
always  remain  within  known  bounds,  then  it  is  possible  to  choose  D 
constant  and  still  obtain  the  desired  stability  properties.  Such  a  con¬ 
trol  law  is  considerably  simpler  to  mechanize  than  that  given  in  Eq. 
(10.1-10).  The  performance  of  the  system  with  a  fixed  value  of  D  is 
compared  with  the  control  law  in  Eq.  (10. 1-15)  in  the  simulation  results 
reported  below. 


10.2  SELECTION  OF  PARAMETERS 

For  simulation  purposes  the  airframe  dynamics  are  taken  from 
the  trajectory  data  in  Section  H.  1  with  the  actuator  dynamics  neglected. 
The  dynamics  of  the  reference  model  are  taken  from  flight  condition  6  in 
Table  H.  3.  To  implement  the  control  law  in  Eq.  (10. 1-10),  values  for  the 
quantities  | p./kj  are  required.  Referring  to  Eq.  (10. 1-6)  and  the 
flight  data  in  Table  K.  3  it  follows  that 
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f> 2 

k 

p3 

k 


p4 


p5 

k 


max 


max 


max 


b  -b 
m 


max 

*m-a+pl(bm-b) 


max 


Pj-z 


max 


max 


m 


MO 


max 


max 


■vpiMi) 


a  1.5 


3.0 


^  3.0 


^  17.0 


—  3.0 


max 


m 

k 


-  9.0 


max 


In  addition,  the  saturation  parameter  €  in  Eq.  (10.1-10)  is  assigned  the 
value  0.01. 


10.3  PERFORMANCE  EVALUATION 

To  demonstrate  the  operation  of  the  pitch  rate  control  system 
illustrated  in  Fig.  10.1-2,  a  simulation  of  the  autopilot  was  conducted  with 
the  aii-frame  parameters  fixed  at  values  specified  by  flight  condition  2  in 
Table  H.l.  In  Fig.  10.3-1  the  reference  model  response  and  the  output 
error  are  plotted  for  a  step  injaut  command.  Except  for  the  initial 
transient,  the  error  is  less  than  2  percent  of  the  model  output,,  Further¬ 
more  the  magnitude  of  the  error  is  always  considerably  less  than  the 
theoretical  upper  bound  of  0.01  specified  by  Eq.  (10. 1-14).  This  is  to  be 
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Figure  10.3-1  Step  Response  for  the  Pitch  Rate  Autopilot 

With  a  Time-Invariant  Plant 

expected  because  the  nonlinear  gain  D  in  Eq.  (10. 1-15)  is  conservatively 
large  most  of  the  time. 

The  behavior  of  this  autopilot  when  parameters  are  time  varying 
is  illustrated  in  Fig.  10.3-2  for  the  first  two  seconds  of  thrusting  flight 
(Table  H.l),  beginning  in  flight  condition  1  at  t  =  5.0  seconds.  Both  the 
pitch  rate  response  of  the  reference  model  and  the  output  error  are  shown. 
The  latter  is  on  the  order  of  1%  of  the  former  after  an  initial  transient. 
Evidently  the  control  system  is  quite  capable  of  keeping  the  error  small 
along  such  a  trajectory. 

The  above  performance  compares  favorably  with  that  shown  in 
Fig.  8. 3-7  for  the  Liapunov  adaptive  design.  However,  against  this  ad¬ 
vantage  must  be  weighed  the  fact  that  Fig.  10.1-2  is  basically  a  high  gain 
design,  as  indicated  in  Fig.  10.1-3.  Consequently  higher  order  modes  such 
as  airfrarae  structural  vibrations  may  be  excited  and  greater  control  levels 
are  generally  required. 
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/  lit] 


(a)  REFERENCE  MODEL  PITCH  RATE  OUTPUT 
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Figure  10.3-2  Step  Response  for  the  Pitch  Rate  Autopilot 

With  a  Time-Varying  Plant 
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To  compare  the  system  in  Fig.  10.1-2  with  the  case  where  the 
drive  level  of  the  nonlinearity  in  Fig.  10.1-3  is  fixed,  simulations  were 
performed  with  D  in  Eq.  (10.1-15)  set  equal  to  a  constant.  The  results  for 
two  levels  of  D  are  presented  in  Fig.  10.3-3  for  flight  condition  2  in 
Table  H.  1;  the  airframe  parameters  and  the  input  command  have  the  same 
values  as  those  used  to  generate  Fig.  10. 3-1.  Evidently  the  use  of  a  suf¬ 
ficiently  large  constant  value  of  D  gives  performance  which  is  comparable 
to  the  control  law  in  Eq.  (10.1-10).  In  these  simulations  the  error  never 
exceeds  the  linear  range  of  the  function  sat  (e(t))  so  the  system  operation 
remains  linear.  This  controller  configuration  is  much  simpler  to  imple¬ 
ment  than  that  in  Fig.  10. 1-2;  however  the  latter  may  have  some  opera¬ 
tional  advantage  in  that  the  nonlinear  gain,  D/ e,  becomes  small  when  the 
signals  in  the  system  are  small  (see  Eq.  (10.1-15)),  requiring  a  smaller 
control  level. 


Figure  10. 3-3  Autopilot  Response  for  the  Pitch  Rate  Autopilot 

With  a  Time -Invariant  Plant  and  Constant 
Drive  Level  D 
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10.4  SUMMARY  AND  CONCLUSIONS 

The  essential  feature  of  the  insensitive  controller  described  in 
this  chapter  is  that  the  system  output  error  is  passed  through  a  saturating 
high  gain  element  having  a  saturation  level  D  that  is  a  nonlinear  function  of 
variables  generated  by  the  plant  and  by  various  compensation  networks. 
The  system  can  be  represented  by  Fig.  10. 1-3  where  D  varies  as  pre¬ 
scribed  in  Eq.  (10. 1-15).  Consequently,  for  an  error  signal  that  satisfies 
Eq.  (10.1-14),  the  compensation  operates  essentially  as  a  high  gain  with 
value  D/e. 


The  use  of  high  gain  controllers  is  common  practice  for  desen¬ 
sitizing  a  system  to  plant  parameter  variations,  particularly  for  pitch  rate 
and  roll  rate  autopilots  in  aircraft  and  missile  applications.  As  noted  in 
Section  4.5,  such  techniques  applied  to  autopilot  design  may  be  excessively 
sensitive  to  noise  and  may  excite  structural  bending  modes.  The  main  con¬ 
tribution  of  the  design  approach  evaluated  here  is  that  it  generates  the  gain 
factor  D  as  a  nonlinear  function  of  system  variables  such  that  the  system 
has  desired  stability  properties.  Furthermore,  if  the  signal  levels  are 
known  to  be  bounded,  the  theory  extends  to  the  case  where  D  is  a  suffi¬ 
ciently  large  constant. 

As  noted  in  Section  7.4  this  design  method  is  not  suitable  for 
nonminimum  phase  plants,  for  essentially  the  same  reason  given  in 
Section  4.4.4.  To  make  the  output  error  small,  the  plant  input  u(t)  must 
cancel  the  effect  of  any  right-half-plane  zeros,  tending  to  make  the  system 
unstable.  Consequently  this  technique  is  not  directly  applicable  for 
achieving  good  normal  acceleration  response  in  tail-controlled  missiles 
having  fixed  wings.  However,  using  the  artifice  of  an  adaptive  reference 
model  in  the  same  fashion  described  in  Section  8.2.4,  the  system  in 
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Figure  10.  1-3  can  be  modified  to  provide  adaptive  normal  acceleration 
response.  This  technique  requires  that  enough  plant  parameters  be  iden¬ 
tified  so  that  the  corresponding  parameters  in  the  reference  model  can  be 
adjusted  adaptively  to  yield  a  model  pitch  rate  transfer  function  that  cor¬ 
responds  to  the  desired  normal  acceleration  response.  For  the  applica¬ 
tion  considered  in  this  chapter,  only  a  single  plant  zero,  z,  and  the  mis¬ 
sile  airspeed  V  need  to  be  identified;  the  modification  required  to  the  con¬ 
trol  system  in  Fig.  10.1-3  is  shown  in  Fig.  10.4-1.  No  simulation  of 
this  method  is  presented  here. 

*-2735 

AVAILABLE 


Figure  10.4-1  Adaptive  Reference  Model  Configuration 
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11.  GUIDANCE  SYSTEMS  FOR  TACTICAL  MISSILES 


The  objective  of  guidance  is  to  prescribe  a  steering  law  for  the 
missile  that  will  transfer  it  from  the  launcher  to  a  target.  To  accomplish 
this  task,  different  guidance  laws  may  be  used  along  different  portions  of 
the  missile’s  trajectory,  depending  upon  the  information  available,  the 
maneuvers  required,  and  the  control  mechanisms  in  use.  For  this  dis¬ 
cussion  it  is  convenient  to  think  of  three  distinctive  guidance  stages  which 
can  exist  in  a  tactical  mission;  these  are  the  post-launch,  midcourse,  and 
homing  phases. 

Post-launch  guidance  is  concerned  with  the  missile  trajectory 
immediately  after  separation  from  the  launcher.  This  stage  includes  any 
special  maneuvers  required  (such  as  a  180  degree  turn)  to  head  the  mis¬ 
sile  in  the  general  direction  of  the  target  or  to  orient  sensors  so  that  they 
can  acquire  the  target.  For  example,  steering  commands  can  be  pro¬ 
grammed  in  open  loop  fashion  or  provided  by  the  launch  vehicle  during  this 
phase  of  the  trajectory. 

Midcourse  guidance  is  usually  employed  in  a  relatively  long 
range  mission.  A  midcourse  guidance  law  is  used  to  direct  the  missile  to 
a  region  near  the  target,  within  which  a  homing  sensor  can  provide  accu¬ 
rate  target  information.  An  important  requirement  of  this  phase  is  a 
means  for  providing  knowledga  of  missile  position  (e.g. ,  inertial  navigator, 
radio  tracking,  etc. )  enroute  to  the  target  area.  Midcourse  guidance  must 
be  sufficiently  accurate  to  enable  the  homing  sensor  to  acquire  the  target. 

Homing  guidance  is  usually  used  during  the  final  phase  of  mis¬ 
sile  flight.  In  a  sense,  this  is  the  most  critical  period  because  steering 
actions  taken  during  the  last  few  seconds  of  flight  have  the  most  effect  on 
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terminal  miss  distance.  Overall  mission  success  strongly  depends  upon 
having  a  homing  sensor  which  provides  accurate  information  about  the 
target’s  location  relative  to  the  missile,  and  a  steering  law  that  is  capable 
of  achieving  intercept  in  the  presence  of  target  maneuvers  and  measure¬ 
ment  errors.  Only  the  homing  phase  is  treated  in  this  chapter. 

The  chapter  begins  with  a  review  of  classical  homr  3  guidance 
techniques.  The  remainder  concentrates  on  analyzing  the  accuracy  of 
several  homing  guidance  laws  derived  using  techniques  of  optimal  control 
theory  to  account  for  constant  target  maneuvers  and  autopilot  dynamics. 
Graphs  relating  sensitivity  of  terminal  accuracy  and  control  effort  expended 
to  initial  condition  and  measurement  bias  errors  are  presented.  These 
curves  aid  in  judging  the  relative  performance  capability  of  various  designs; 
in  particular  they  provide  a  quantitative  comparison  between  optimal  and 
suboptimal  control  techniques.  The  analysis  presented  here  is  essentially 
deterministic;  it  does  not  include  measurement  noise  and  randomly  varying 
target  maneuvers  in  the  mathematical  model  of  the  guidance  problem.  As 
such,  it  provides  a  basis  for  making  preliminary  decisions  about  which 
guidance  law  is  most  appropriate.  For  a  specific  application  further  refine¬ 
ment  of  the  conclusions  obtained  here  should  be  make  by  investigating  these 
random  effects. 


11.1  GUIDANCE  EQUATIONS 

For  the  purpose  of  guidance,  the  missile  equations  of  motion  are 
those  which  represent  a  point  mass  moving  in  three  dimensions,  acted  upon 
by  aerodynamic,  gravitational,  and  thrust  forces.  To  take  the  effect  of  the 
missile’s  rotational  dynamics  into  account  assuming  the  autopilot  has 
already  been  designed,  the  airframe  response  is  represented  by  some  type 
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of  transfer  function  „  In  addition  to  missile  motion,  the  guidance  equations 
must  include  the  fact  that  the  target  is  maneuvering  (accelerating)  in  three 
dimensions.  If  accurate  mathematical  descriptions  of  all  these  factors  are 
employed  in  formulating  the  guidance  problem,  the  performance  analysis 
for  various  steering  laws  is  a  difficult  task.  However,  many  of  the  essen¬ 
tial  features  of  the  guidance  problem  can  be  studied  by  considering  only 
planar  motion  of  missile  and  target  and  by  neglecting  gravity  and  aero¬ 
dynamic  drag,  as  illustrated  in  Fig.  11 . 1-1.  Furthermore  it  is  assumed 
that  both  missile  and  target  velocities,  vm  and  v^,  are  constant  in  mag¬ 
nitude  with  variable  directions  that  are  controlled  by  lateral  vehicle 
acceleration  approximately  normal  to  the  velocity;  losses  of  missile  and 
target  airspeed  and/or  altitude  caused  by  maneuvers  are  neglected.  For 
an  aerodynamically  controlled  missile  the  above  conditions  imply  the 
vehicle  is  in  coasting  flight  with  control  forces  provided  by  its  lifting  sur¬ 
faces. 


Figure  11.1-1  Definitions  of  Guidance  Variables 
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are 


With  reference  to  Fig,  11,1-1  the  missile  equations  of  motion 


x  (t)  =  -  a  (t)  sin  0  (t) 
m  m  pm 


y  (t)  =  a  (t)  cos  0  (t) 
Jm  '  m  '  pmv  ’ 


a  (t)  =  -  a  a  (t)  +  a  a  (t) 
m  me 


a  (t)  =  vfl(t) 
m  mm 


(11.1-1) 


where  or  represents  the  pitch  autopilot  time  constant  associated  with 
achieving  a  particular  commanded  normal  acceleration  and  a  (t)  is  the 

V 

steering  command.  Similar  expressions  hold  for  the  target: 

xt(t)  =  at(t)  sin  /3t(t) 
yt(t)  =  at(t)  cos  0t(t) 

at(t)  =  vt4t(t)  (11.1-2) 

To  derive  a  feedback  steering  command  some  assumptions  are 
required  about  target  acceleration,  and  measurements  of  important  state 
variables  must  be  available.  In  addition  the  problem  can  be  simplified  if 
the  nonlinear  terms  in  Eqs.  (11.1-1)  and  (11.1-2)  are  removed  by  appro¬ 
priate  linearizations.  These  tasks  are  accomplished  in  various  ways, 
depending  upon  the  particular  guidance  method  used;  specific  details 
are  given  in  subsequent  sections  for  several  homing  guidance  tech¬ 
niques. 
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11.2  HOMING  GUIDANCE  TECHNIQUES 

A  review  of  possible  homing  guidance  methods  naturally  divides 
the  subject  into  two  categories  —  those  techniques  which  may  be  thought  of 
as  ’’classical”,  in  the  sense  that  they  predate  the  development  of  modem 
control  theory  and  those  referred  to  as  "optimal”  which  utilize  more  re¬ 
cently  developed  techniques  for  system  design.  In  this  section,  a  sum¬ 
mary  of  the  important  features  of  these  methods  is  presented. 


11.2.1  Classical  Homing  Guidance  Techniques 

Three  well  known  concepts  for  directing  a  missile  during  its 
homing  phase  are  pursuit,  beam  rider,  and  proportional  guidance  (Refs. 
133,  134,  135).  The  principles  of  each  of  these  techniques  are  described 
here  within  the  framework  of  Eqs.  (11. 1-1)  and  (11. 1-2). 

Pursuit  Guidance  —  One  of  the  first  ideas  for  guiding  an  inter¬ 
ceptor  vehicle  was  to  point  its  velocity  vector  directly  at  the  target.  The 
implementation  of  this  concept  is  the  essential  characteristic  of  pursuit 
guidance.  In  terms  of  the  variables  defined  in  Fig.  11. 1-1,  the  objective 
is  to  turn  the  missile’s  velocity  vector  v  m  so  that  it  lies  along  the  line  of 
sight  —  i.e.,  achieve  the  condition,  =  X.  The  method  of  steering  used 
to  accomplish  this  task  is  to  require  that  the  commanded  missile  accelera¬ 
tion  be  given  by 


ac(t)  =  -  k  ^9m(t)  -  X(tj) 
where  k  is  an  appropriate  constant  gain  (Ref.  19). 


(11.2-1) 
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1 


I 


An  indication  of  the  effectiveness  of  this  steering  command  is 
obtained  by  assuming  that  the  angles  0m(t)  and  X(t)  do  not  change  very 
much  during  the  homing  phase  and  that  their  difference  is  small  in  mag¬ 
nitude.  This  allows  the  x-y  coordinate  axes  in  Fig.  11.1-1  to  be  chosen 
so  that  both  angles  remain  small;  e.  g. ,  define  the  x  axis  to  be  along  the 
initial  line  of  sight.  Making  the  additional  approximation  that  the  effect 
of  autopilot  lag  is  negligible  in  Eq.  (11. 1-1)  and  using  Eq.  (11.2-1),  one 
has 


am(t)  *  ac(t)  =  -  k  (fijt)  -  m)  (11.2-2) 

The  above  assumptions  permit  our  writing  Eq.  (11.1-1)  as 


x  (t)  -  -  a  (t)  0  (t) 
m  cv  '  m  ' 

y m(t)  -  ac(t)  (11.2-3) 

To  relate  Eqs.  (11.2-2)  and  (11.2-3),  the  small  angle  approxi¬ 
mation  is  used  in  Fig.  11.1-1  to  write 


y  Jt) 

<Ut>  - 


m 


xj 
m 


yt(t)  ‘  ym(t) 

x<‘> *  feoj 

t  m 


(11.2-4) 


We  now  make  the  definitions 

y(  t)  =  /3m(t)  -  X(t)  r^^)  =  xt(t)  -  xm(t)  vc  =  -  fgQ(t)  =  constant 

(11.2-5) 
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where  r  (t)  is  the  range-to-go  and  v  is  the  closing  velocity.  Remember 
go  c 

that  che  objective  of  the  guidance  system  is  to  drive  y(t)  to  zero.  Differ¬ 
entiation  of  y(t)  in  Eq.  (11.2-5)  with  substitution  from  Eq.  (11.2-2)  through 
(11.2-5)  and  some  algebraic  manipulation  produces  a  differential  equation 


m 


y(t) 


yt(t)  xt(t) 
Fit)  +FltI 

go'  go' 


X(t) 


(11.2-6) 


where  the  approximation 


v  =  x  (t) 
m  mv  ' 


is  used,  consistent  with  the  small  angle  assumption. 


If  the  target  is  stationary,  ±^(t)  =  yt(t)  =  0.  The  range-to-go 
is  given  by 


(t) 


r  (0) 
go' 


t 

c  go 


where  t^  is  the  time-to-go  until  intercept.  Consequently  Eq.  (11. 2-6) 
becomes 


r(t)  --  \^f- - )  y(t)  (11.2-7) 

\  c  go  m  / 

In  this  expression  the  coefficient  of  >(t)  contains  a  positive  term  which 
approaches  infinity  as  t  approaches  0.  Therefore  the  differential  equa¬ 
tion  for  y(t)  becomes  unstable  just  before  intercept  and  y(t)  cannot  be 
nulled.  However,  in  practice  it  is  possible  to  choose  k  sufficiently  large 
so  that  the  coefficient  of  y(t)  in  Eq.  (11. 2-7)  is  negative  until  very  near 
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intercept,  yielding  an  acceptable  terminal  miss.  This  accounts  for  the 
satisfactory  performance  of  pursuit  guidance  against  surface  targets 
(Ref.  19). 


Against  high  velocity  moving  targets,  the  terms  involving  x^(t) 
and  yt(t)  in  Eq.  (11.2-8)  are  detrimental  to  the  objective  of  driving  y(t)  to 
zero.  In  addition,  the  path  followed  by  the  missile  is  not  very  efficient, 
as  indicated  in  Fig.  11.2-1.  For  this  reason  pursuit  guidance  is  not 
usually  considered  feasible  against  air  targets  (Ref.  23). 

Beam  Rider  Guidance  —  The  fundamental  idea  behind  beam  rider 
guidance  systems  is  that  the  target  is  tracked  by  some  type  of  active  trans¬ 
mitter  (e.g.,  a  radar)  external  to  the  missile,  often  located  in  the  launch 
vehicle  (Ref.  133).  The  situation  is  depicted  in  Fig.  11.2-2.  The  missile 
is  equipped  with  a  sensor  that  provides  a  measure  of  the  deviation  of  the 
missile  from  the  beam  centerline,  e.g.,  the  distance  x  in  Fig.  11.2-2. 

In  order  that  the  missile  be  directed  toward  the  center  of  the 
beam,  its  lateral  (commanded)  intertial*  acceleration,  z(t),  can  be  a  linear 
function  of  x(t)  and  x(t),  i.e., 


*(t)  =  -klX(t)  -k2x(t)  (11.2-8) 

In  terms  of  the  lateral  acceleration,  j?(t),  of  the  beam  we  have 

z(t)  =  X(t)+?(t)  (11.2-9) 


$ 

Inertial  acceleration  (exclusive  oi  the  effects  of  gravity)  is  the  quantity 
measured  by  the  missile’s  sensors.  Consequently  the  acceleration 
applied  to  control  the  missile  is  thought  of  aB  inertial  although  the 
quantity  of  interest  is  acceleration  with  respect  to  the  beam. 
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-MOST  EFFICIENT 
TRAJECTORY 
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VELOCITY  TARGET 


LINE-OF-SIGHT 

RAYS 


/ 


LAUNCH  POINT 


Figure  11.2-1  Pursuit  Guidance  Trajectory 

As  the  missile  approaches  the  target,  the  beam’s  acceleration  is  approxi¬ 
mately  equal  to  the  component  of  target  acceleration  normal  to  the  beam. 
If  the  latter  is  zero,  then 

m  -  o 

and  Eqs.  (11.2-8)  and  (11.2-9)  combine  to  yield 
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*•2032 


Figure  11.2-2  Beam  Rider  Guidance  Trajectory 


x(t)  kgiCt)  +  kjX(t)  =  0 


(11.2-10) 


Evidently  both  x(t)  and  x(t)  must  be  measured  in  order  to  have  a  damped 
response.  Furthermore  the  damping  must  be  sufficient  so  that  x(t)  is  suf¬ 
ficiently  close  to  zero  by  the  time  of  intercept.  As  in  the  case  of  pursuit 
guidance,  the  missile  does  not  follow  the  most  efficient  trajectory  for  a 
moving  target, 
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The  beam  rider  concept  has  the  advantage  that  the  missile  need 
not  have  its  own  automatic  target  tracking  mechanism,,  If  it  is  launched 
within  the  radar  beam,  it  is  relatively  easy  to  design  a  receiver  for  the 
missile  that  will  tend  to  maintain  a  small  distance  x  in  Fig„  11. 2-2.  On 
the  other  hand,  an  objection  for  some  applications  is  the  dependence  upon 
an  external  source  to  provide  the  beam.  It  is  often  desirable  to  have  a 
self-contained  system  which  frees  the  launcher  vehicle  from  further  guid¬ 
ance  responsibility  after  firing  the  missile. 

Proportional  Guidance  is  an  application  of  the  principle  that  a 
collision  course  between  two  moving  objects  is  one  for  which  they  approach 
each  other  at  a  constant  relative  bearing.  In  terms  of  Fig.  11.1-1  this 
means  that  the  angular  rate  of  the  line  of  sight  (LOS)  is  to  be  held  at  zero, 

X(t)  =  0 

With  reference  to  Fig.  11.2-3,  this  condition  becomes 

<p(t)  +  $<t)  =  0 

The  actual  values  of  <p(t)  and  0(t)  can  be  measured  with  sensors  to  provide 
a  measure  of  the  deviation  of  line-of-sight  rate  from  zero. 

Assuming  that  this  guidance  scheme  works  well,  the  line-of- 
sight  should  not  rotate  very  much  along  the  intercept  trajectory;  hence  for 
analysis  purposes  it  is  useful  to  define  the  relative  coordinate  system  in 
Fig.  11.2-4.  The  x-axis  of  this  coordinate  frame  is  the  LOS  at  the  initial 
time,  t  =  0,  and  the  rotation  rate  of  the  LOS  is  given  by  X(t).  Distance 
along  the  x  axis  is  referred  to  as  range-to-go,  r  .  The  steering  law  for 
proportional  guidance  dictates  that  an  acceleration  be  applied  normal  to 
the  LOS  and  proportional  to  X(t)  in  such  a  way  as  to  reduce  |  X(t)  | ;  the  con¬ 
ventional  form  of  this  relation  is  (Ref.  25) 
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K~tOS3 


INTERCEPT 

POINT 


Figure  11.2-3  Proportional  Guidance 


y(t)  =  -T?vc\(t)  (11.2-11) 


The  quantity  r\  is  a  constant  of  proportionality  and  v  is  the  closing  velocity 

c 

(assumed  constant  in  our  analysis). 


(t) 


The  performance  of  this  guidance  law  is  ascertained  by  noting 
that  for  small  X(t), 
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*-7034 


Figure  11.2-4  Relative  Coordinate  System  in 

Proportional  Guidance 


X(t)  = 


JSL 

rg^ 


(11.2-12).. 


Repeated  differentiation  of  this  expression  with  substitution  from  Eq. 
(11.2-11)  yields 


X(t)  =  -X(t) 


(11.2-13) 


where 


t 

go 


r  (0) 

-Ell-t; 


v 


0  <  t  < 


r 


The  solution  to  Eq.  (11.2-13)  is 
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X(t)  =  X(0) 


(11.2-14) 


Evidently  X(t)  approaches  zero  with  t 


go 


if 


tj  >  2 


The  conclusion  is  that  if  the  constant  of  proportionality  in  Eq. 
(11.2-11)  is  large  enough,  this  steering  law  does  succeed  in  reducing  the 
LOS  rate  (and  hence  also  the  terminal  miss  distance)  to  zero  for  a  con¬ 
stant  velocity  target.  Moreover,  the  missile's  path  in  inertial  space  is 
nearly  a  straight  line,  which  is  the  most  efficient  (minimum- time)  tra¬ 
jectory  against  a  nonmaneuvering  target.  Because  of  its  ability  to  actually 
null  the  terminal  miss  and  the  simplicity  of  the  acceleration  command  in 
Eq.  (11.2-11),  proportional  guidance  has  been  favored  for  use  in  most 
missile  systems.  Under  the  assumptions  used  in  the  foregoing  analysis  — 
i.  e.,  a  constant  velocity  target  with  no  autopilot  lag  —  good  performance 
is  achieved.  However,  as  discussed  in  Chapter  3,  the  ability  of  a  target 
to  maneuver  and  the  presence  of  autopilot  dynamics  do  reduce  terminal 
accuracy.  This  raises  the  question  of  whether  such  effects  can  be  taken 
into  account  to  derive  a  better  steering  law.  In  the  next  section,  it  is 
demonstrated  that  optimal  control  theory  offers  one  means  to  this  end. 


11.2.2  Adaptive  Optimal  Guidance 

To  apply  optimal  control  techniques  to  the  design  of  missile 
guidance  systems,  the  problem  is  mathematically  described  in  terms  of 
a  set  of  first  order  differential  equations.  When  the  latter  are  linear  and 
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a  steering  policy  is  sought  which  minimizes  a  quadratic  performance  index, 
one  obtains  a  feedback  steering  law  that  is  a  linear  function  of  the  system 
state  variables.  In  this  manner  desired  performance  characteristics  can 
often  be  achieved  by  a  design  that  is  practical  to  implement  in  actual  appli¬ 
cations.  Several  optimal  linear  steering  laws  are  described  and  evaluated 
in  this  and  subsequent  sections.  The  problem  formulation  is  taken  from 
Ref.  (7). 


The  equations  of  motion  for  the  guidance  problem  are  obtained 
by  consideration  of  Fig.  11.2-4.  Just  as  in  proportional  guidance,  it  is 
assumed  that  the  line-of-sight  does  not  rotate  very  much  from  its  initial 
position  during  the  homing  phase.  The  x-axis  is  the  initial  LOS.  Because 
the  range  is  decreasing  at  a  relatively  uncontrollable  rate  (i.e.,  v  is 
nearly  constant),  the  principal  variables  of  interest  in  Fig.  11.2-4  are  y(t< 
and  its  time  derivatives. 

Problem  Statement  —  We  assume  that  the  result  of  a  positive 
command  to  the  missile  autopilot  is  acceleration  along  the  negative  y  axis. 
Actually,  the  autopilot  creates  an  acceleration  yector  that  is  approximately 
normal  to  the  missile’s  velocity  vector;  any  resulting  acceleration  com¬ 
ponent  along  the  line-of-sight  is  being  neglected.  The  autopilot  dynamics 
are  approximated  by  a  first  order  lag,  **  as  in  Eq.  (11. 1-1).  In  addition, 
we  allow  the  possibility  of  the  target's  having  a  constant  acceleration  at 
normal  to  the  LOS  along  the  positive  y-axis.  With  these  specifications,  a 
set  of  state  variables  cam  be  defined  as 


* 

It  is  tacticly  assumed  that  the  missile  has  no  capability  for 
controlling  its  longitudinal  thrust. 

In  Section  11.4  a  higher  order  model  is  investigated  which  includes 
the  right -half -plane  zero  associated  with  a  tail-controlled  missile. 
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Because  the  autopilot  dynamics  are  included  in  formulating  the  guidance 
problem  —  i.e.,  the  autopilot  is  predesigned  --  we  refer  to  this  as  a 
partially  coupled  set  of  guidance-autopilot  equations.  A  completely  coupled 
guidance-autopilot  steering  law,  which  incorporates  missile  airframe 
dynamics  within  the  mathematical  model,  is  investigated  in  Section  11.4. 


For  the  purpose  of  guidance,  it  is  desired  that  the  terminal  miss 
distance  Xj(T)be  made  small  within  the  capability  of  a  limited  amount  of  con¬ 
trol  ac(t).  If  unlimited  acceleration  were  available,  a  zero  miss  distance 
could  always  be  achieved  in  the  absence  of  measurement  errors.  These  con¬ 
siderations  motivate  the  following  choice  for  a  quadratic  performance  index; 


(11.2-17) 


where  r  is  a  positive  weighting  constant.  The  steering  law  is  to  be  chosen 
such  that  J  is  minimized  for  a  given  value  of  T.  The  latter  is  assumed 
known  from  the  relationship 
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T  = 


(0) 


v 


c 


(11.2-18) 


As  we  shall  see,  this  problem  formulation  requires  the  existence  of  a 
capability  for  measuring  range  and  range  rate  for  implementing  the  result¬ 
ing  steering  command. 


The  first  term  in  J  penalizes  large  values  of  terminal  miss.  The 
inclusion  of  the  integral  term, 


J 


u 


A 


in  the  performance  index  has  several  objectives  for  a  missile  guidance  ap¬ 
plication.  First  it  tends  to  limit  the  peak  normal  acceleration  output;  this 
feature  is  required  to  prevent  structural  failure  of  the  airframe.  In  addition, 
there  are  physical  constraints  on  the  maximum  magnitude  of  the  missile's 
control  surface  deflection.  The  latter  saturates  if  the  steering  command 
is  too  large  and  the  linear  character  of  the  autopilot  dynamics  is  destroyed. 
Consequently  the  limitation  on  a  (t)  provided  by  minimizing  J  tends  to  keep 

V 

the  control  surface  deflection  within  required  bounds.  Missile  maneuvers 
also  require  some  expenditure  of  energy  which  ultimately  results  either  in 
a  loss  of  altitude  or  a  loss  of  airspeed.  These  losses  increase  as  the  size 
of  the  integral  term  in  J  increases.  Other  energy  consumption  influenced 
by  Ju  is  that  required  to  drive  the  control  surfaces.  For  a  long  trajectory, 
the  continued  presence  of  changing  steering  commands  at  the  autopilot 
input  can  result  in  an  appreciable  drain  on  the  actuator  power  supply;  this 
effect  is  also  limited  by  limiting  the  magnitude  of  a_(t).  For  all  of  the 
above  reasons  the  integral  term  in  Eq.  (11.2-17)  at  least  qualitatively 
regulates  the  actual  control  signals  and  control  forces  employed  to  reduce 
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the  terminal  miss  distance.  However  it  is  emphasized  that  Ju  is  only  an 
indirect  measure  of  control  surface  deflection,  actual  normal  acceleration, 
and  energy  losses. 

Having  specified  a  performance  index,  all  of  the  remaining  sub¬ 
jectivity  in  the  design  problem  is  contained  in  the  parameter  r.  The  lar¬ 
ger  its  value,  the  heavier  the  penalty  on  the  control  relative  to  x^(T);  con¬ 
sequently  the  larger  is  the  terminal  miss.  It  should  be  chosen  to  provide 
an  acceptable  tradeoff  between  guidance  accuracy  and  control  effort. 

The  solution  to  the  optimal  control  problem  is  derived  analytically 
in  Ref.  7.  The  optimal  feedback  steering  law  is  given  by: 


5*  * 
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where  is  the  i^1  element  of  h.  The  feedback  gains  are  written  as  func¬ 
tions  of  time-to-go  rather  than  time.  Because  the  range  does  not  decrease 
exactly  linearly  with  time,  the  value  of  t  is  not  accurately  predictable 
along  the  trajectory.  However,  given  a  capability  for  measuring  range 
r^Q  and  range  rate  (-vc),  t  can  be  measured  continuously  and  adaptively 
adjusted  in  the  steering  law,  making  this  is  an  adaptive  optimal  guidance 
policy. 


Aside  from  requiring  some  computational  capability  to  imple¬ 
ment  the  steering  law,  the  above  guidance  policy  assumes  knowledge  of 
all  the  system  state  variables;  this  is  typical  of  feedback  solutions  to  opti¬ 
mal  control  problems.  At  first  sight,  this  requirement  appears  imprac¬ 
tical  because  some  of  the  states,  defined  in  Eq.  (11. 2-lb)  are  not  readily 
available.  In  particular,  x^t)  and  x2(t)  are  position  and  velocity  normal 
to  cn  initial  line-of-sight  reference  axis;  neither  of  these  quantities  is 
easily  measured.  This  situation  can  be  improved  if  we  differentiate  Eq. 
(11.2-12)  to  obtain 


X(t)  * 


y(t)  -  *3)  y(t) 


go 


r  (t)‘ 
gov 


(11.2-20) 


Substitution  for  r  (t)  and  f^Q(t)  from  Eqs.  (11.2-5)  and  (11. 2-19)  produces 


(11.2-21) 


Substitution  of  this  expression  into  Eq.  (11.2-19)  to  eliminate  x^(t)  and  Xg(t) 
yields  a  simpler,  approximate  steering  law 
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(11.2-22) 


The  resemblence  between  the  term  dependent  upon  X  and  the  steering  law 
for  proportional  guidance,  Eq.  (11.2-11),  is  noteworthy.  Its  significance 
will  be  more  evident  presently. 


Equation  (11.2-22)  is  more  easily  mechanized  than  Eq.  (11. 2-19) 
because  y(t)  and  y(t)  have  been  replaced  by  a  more  readily  measured 
variable,  the  line-of-sight  rate.  The  quantity  x^(t)  is  the  missile's  normal 
acceleration  which  can  be  measured  by  an  accelerometer.  The  target’s 
normal  acceleration  Xg(t)  can  be  inferred  by  differentiating  Eq.  (11.2-20) 
to  obtain 


X(t) 


2v 

■  X(t) 

go  go 


Using  ttie  fact  that 


y(t)  =  at  -  am(t) 


one  obtains 


a.  =  r — X(t)  -  2v„X(t)  +  a  (t) 


go 


m' 


(11.2-23) 


We  have  already  postulated  the  capability  to  measure  all  quantities  in  Eq. 
(11.2-23)  with  the  exception  of  X(t).  The  latter  can  be  obtained  by  differ¬ 
entiating  >{t).  The  need  to  obtain  angular  acceleration  of  the  line-of-sight 
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and  to  implement  Eq.  (11. 2-23)  makes  a^.  the  most  difficult  state  variable 
to  estimate . 


From  Eq.  (11.2-19)  cae  can  infer  other  optimal  steering  laws 
which  neglect  some  of  the  dynamics  in  Eq.  (11.2-16).  In  the  limit  of  no 
autopilot  lag  (a  approaches  infinity)  the  optimal  control  is 


a.  (t) 

"2 


f,2(‘go) 


t2 

go 


1 

—  0.5 
t 

go 


^(t) 

x2(t) 

X3(t) 


3t‘ 


_go_ 


3r+t' 


go 


and  use  of  Eq.  (11.2-21)  produces 


(11.2-24) 


(11.2-25) 


If  both  target  acceleration  and  autopilot  lag  are  neglected,  the 
optimal  steering  law  is 


Xj(t) 

x2(t) 


(11.2-26) 


Again,  substituting  from  Eq.  (11.2-21),  a  (t)  can  be  approximated  as 
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(11.2-27) 


It  is  interesting  to  see  what  happens  to  Eq.  (11.2-27)  as  the  weighting  r  in 
the  performance  index  goes  to  zero;  i.e.,  from  Eq.  (11.2-24)  it  follows  that 

lim  a  (t)  =  3v  X(t)  (11.2-28) 

r-0  c3  C 

The  right  hand  side  of  Eq.  (11. 2-28)  is  identical  with  the  form*  of  the  pro¬ 
portional  steering  law,  Eq.  (11.2-11)  with  7?  =  3.  In  words  Eq.  (11.2-28) 
states  that  the  classical  proportional  steering  law  with  rj  =  3  is  the  limit  of 
the  optimal  law  in  Eq.  (11.2-27)  as  the  weighting  on  the  control  in  the 
performance  index  approaches  zero. 


Steering  law  a  (t)  is  optimal  for  minimizing  J  in  Eq.  (11.2-17) 
C1 

subject  to  Eq.  (11.2-16);  ac^(t)  and  aC3(t)  are  optimal  when  the  equations 
of  motion  are  modified  to  eliminate  first  autopilot  lag  and  then  both  auto¬ 
pilot  lag  and  target  acceleration.  However,  we  are  concerned  with  the 
behavior  of  the  state  when  all  dynamics  are  present  (a  <  °°,  a^  i  0)  and 
either  aC2(t)  or  aCg(t)  is  used  in  place  of  the  optimal  control  because  the 
sensors  or  computation  equipment  required  to  implement  ac^(t)  may  not  be 
available.  Consequently  aC2(t)  and  acg(t)  are  referred  to  as  suboptimal 
steering  laws. 


Before  making  a  decision  to  implement  any  of  the  steering  laws 
derived  above,  one  needs  to  know  what  sort  of  performance  can  be  expected 
from  each  in  the  presence  of  all  the  effects  included  in  Eq.  (11.2-16)  and 
♦ 

The  minus  sign  is  missing  from  Eq.  (11.2-28)  because  of  the 
definition  of  positive  ac(t)  in  Eq.  (11.2-16). 
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also  the  influence  of  measurement  errors.  The  purpose  of  the  next  section 
is  to  present  a  sensitivity  analysis  which  determines  the  terminal  miss 
caused  by  state  variable  initial  conditions  and  measurement  error  biases 
for  a  range  of  values  of  the  weighting  parameter  r.  In  addition,  the 
measure  of  control  effort 


is  calculated  for  each  case  to  provide  an  indication  of  the  trade-off  between 
terminal  miss  achieved  and  effort  expended  for  a  particular  steering  strategy. 


11.3  STEERING  LAW  PERFORMANCE  ANALYSIS 

In  this  section  we  are  interested  in  quantitatively  evaluating  the 
terms  which  define  J  in  Eq.  (11.2-17)  for  different  linear  steering  laws 
applied  to  Eq.  (11.2-16).  In  particular,  the  equations  of  motion  have  the 
form 


where 


x(t)  =  Ax(t)  4  ba  (t);  i  =  1,2,3 

■  "  ““  C  • 

1 


a  (t)  =  -  h  (t)T  x(t) 
c.  -i 


(11.3-1) 


(11.3-2) 


andh.(t)  is  defined  as  follows  --  using  Eqs.  (11.2-19),  (11.2-24),  and 
(11.2-26)  and  noting  that  t  =  T  -  t: 

2TO 
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h^t)  =  h(T  - 1) 


h9(t)T  =  -T UT-t)  — 1 

L(T-i 


_  _L  I  o 

T-t  2  U 


h  (t)T  ^  -  r?2(T-t)  1  o  0 

3  2  L(T-t)2  T_t 


(11.3-3) 


Therefore,  for  each  steering  law  we  can  write  Eq.  (11.3-1)  as 


±(t)  =  A.(t)  x(t)  ;  i  =  1,  2,  3 


A.  =  A-bh.(t)T 
l  — i 


0  10  0 

0  0  1-1 

0  0  0  0 

0  0  0  -a 


(11.3-4) 


Associated  with  any  steering  law  ac(t)  there  is  a  value  of 
the  performance  index  for  each  missile  trajectory,  determined  by 
the  particular  set  of  initial  conditions  and  any  state  variable  measure¬ 
ment  errors  that  enter  into  the  implementation  of  Eq.  (11.3-2).  We 
define 


Jf  =  XjfT)2  ; 


T 

Ju  -  (  ac(r)2(lr  (11.3-5) 

t 


where  Ju  is  referred  to  as  the  control  effort  expended  or  simply  the 
"effort."  Referring  to  Eq.  (11.2-17),  note  that 
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J  J,  +  rJ 
f  u 

where  the  lower  limit  of  integration  is  changed  from  0  to  t,  signifying  that 
guidance  can  begin  anytime  prior  to  the  intercept  timeT.  Remember  that  the 
optimal  value  of  *J  associated  with  Eq.(ll.  3-4)  is  achieved  only  when  aCj(t) 
is  used  as  the  steering  command.  Our  purpose  here  is  to  compute  x^(T) 
and  Jyfor  different  trajectories  using  all  three  steering  laws  defined  in 
Eqs.  (11.3-2)and(11.3-3)and  for  different  values  of  the  weighting  constant  r. 


11.3.1  Terminal  Miss  Sensitivity  to  Initial  Conditions 


Nonzero  initial  conditions  on  the  state  x  contribute  to  terminal 
miss  and  to  the  control  effort  expended;  if  x  were  ever  zero  and  the  feed¬ 
back  steering  command  were  perfectly  implemented,  ac(t)  would  remain 
identically  zero  along  the  trajectory  and  both  and  Ju  in  Eq.  (11.3-5) 
would  be  zero.  To  economically  determine  the  miss  x^(T)  for  a  wide 
variety  of  trajectories,  the  adjoint  techniques  described  in  Section  G.  1  are 
helpful.  There  it  is  proved  that  the  miss  associated  with  Eq.  (11. 3-4) 
caused  by  initial  conditions  x(t)  for  t  *  T  is  given  by 


xx(T)  =  tf>.(T,t)T  x(t) 


(11.3-6) 


where  the  sensitivity  function,  satisfies 


£.(T,t)  =  -  A.(t)T  g.(T,t) 


<2i(T,T)  = 


;  -!  *  i,  2,  s 


(11.3-7) 
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The  sensitivity  function  is  evaluated  by  integrating  Eq.  (11.3-7)  (generally 

using  a  numerical  method)  backward  in  time.  Once  <p^( T,t)  is  obtained  and 

tabulated  for  any  steering  law  aP  (t),  the  effects  of  initial  conditions  at 

i 

any  initial  time  on  terminal  miss  can  be  calculated. 

The  value  of  any  element  of  <p^(T,t)  is  the  miss  produced  at  time 
T  by  a  unit  initial  condition  at  time  t  on  the  corresponding  state  and  zero 
initial  conditions  on  all  the  other  state  variables.  For  example,  the  ter¬ 
minal  error  produced  by  a  unit  velocity  normal  to  the  line  of  sight 
(y(0)  =  1.0  ft/sec)  is  equal  to  <p.2(T,0)feet,  where  ^(T,!))  is  the  second 
element  of  cp.(T,0)  with  units  of  ft/ft/sec.  To  introduce  notation  more  sug¬ 
gestive  of  this  property  of  the  sensitivity  function,  we  denote  the  normalized 
miss  distance  associated  with  the  j*h  element  of  o.(T,t)  by  m.^(T,t)  where 
i  refers  to  the  particular  steering  law  ac.(t)  being  employed.  It  is  normal¬ 
ized  by  the  units  of  x(t),  as  indicated  by  the  over-bar  notation. 

Recall  that  the  equations  of  motion  for  the  optimal  guidance  prob¬ 
lem  are  linearized  about  the  initial  line-of-sight.  Therefore  we  can  regard 
the  initial  value  of  y  in  Fig.  11.2-4  (hence  also  of  x^  in  Eq.  (11.3-4))  as 
being  zero.  In  addition,  the  initial  value  of  the  autopilot  state  x^(t)  repre¬ 
sents  an  initial  lateral  acceleration  of  the  missile  at  launch.  If  one  exists, 
it  is  likely  to  be  small  and  have  relatively  little  effect  upon  terminal  miss; 
therefore  we  consider  x^(0)  =  0  also.  Consequently,  the  quantitative  prop¬ 
erties  of  m.^(T,t)  and  m.^(T,t)  will  not  be  displayed  here. 

At  this  point  no  sensitivity  function  has  been  defined  for  the  line- 
of-sight  rate  X(t)  in  ^qs.  (11.2-22),  (11.2-25),  and  (11.2-27).  If  this  state 
variable,  rather  than  y(t)  and  y(t),  is  to  be  used  in  implementing  the  steer¬ 
ing  i  -  are  more  interested  in  its  associated  normalized  miss  rather 
than  that  corresponding  to  an  initial  position  and  velocity  normal  to  the 
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line  of  sight.  To  obtain  this  quantity,  refer  to  Eq.  (11. 2-21)  from  which 
one  can  derive  y(t)  in  terms  of  X(t)  and  y(t), 

y(t)  "  ^(Vgo^'W)  (11.3-8) 

and  define  the  normalized  miss  associated  with  X(t)  to  be  m.,.(T,t). 

Because  Eqs.  (11.2-19)  and  (11.2-22)  are  approximately  equivalent  mech¬ 
anizations  of  the  optimal  steering  command  ac^(t),  it  follows  that  the  ter¬ 
minal  miss  m  caused  by  X(t),  y(t),  and  y(t)  satisfies 

m  =  m^OTjt)  y(t)  +  ml2(T,t)  y(t) 

m  =  m15(T,t)x(t)  (11.3-9) 

Substitute  for  y(t)  from  Eq.  (11.3-8)  into  the  right  hand  side  of  the  first 
expression  in  Eq.  (11.3-9)  aid  eliminate  the  variable  m  to  obtain 

£15<T,t)x(t)  =  £u(T,t)  (Vgo^-w)  ^(T.tjyft)  (11.3-10) 

This  relation  must  hold  for  arbitrary  values  of  y(t)  and  X(t);  consequently, 
equating  the  coefficients  of  X(t)  in  Eq.  (11. 3-10),  it  follows  that 

m15(T,t)  =  mn(T,t)  vc(T-t)2  (11.3-11) 

This  gives  us  an  expression  for  the  normalized  miss  caused  by  X(t)  in  terms 
of  that  caused  by  y(t)  for  optimal  steering  law  aCj(t).  However,  the 
same  relation  between  position,  velocity,  and  line*of-sight  rate  (Eq.  11.2-21)) 
is  used  for  all  the  steering  policies  under  investigation.  Therefore  Eq. 
(11.3-11)  holds  for  all  values  of  the  first  subscript  index; 
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mig(T,t)  =  v„  (T-t)' 


ll 


i  =  1,  2,  3 


(11.3-12) 


Graphs  of  the  functions  m..(T,t)  —  i  =  1,2,3;  j  =2,3,  5,  —  have 

^3 

been  computed  for  different  values  of  the  control  weighting  r  and  autopilot 
natural  frequency  a.  Curves  for  m.^  are  normalized  by  g  (one 
g  =  32.2  ft/sec^)  and  those  for  in.,,  are  normalized  by  v  .  Values  of  1  and 
10  are  chosen  for  a  to  represent  respectively  slow  and  rapid  autopilot 
responses.  The  results  are  plotted  in  Fig.  11.3-1  through  11.3-7.  The 
time  dependence  of  each  normalized  miss  is  in  terms  of  the  quantity  (T-t); 
therefore  the  abscissa  of  each  graph  represents  time-to-go  in  seconds. 
Recall  from  the  definition  of  m.^(T,t)  that  the  time  (T-t)  is  the  instant  at 
which  an  initial  condition  on  one  of  the  state  variables  is  assumed  to  exist; 
physically  this  can  represent  the  time-to-go  at  launch  or  a  point  at  which 
the  target  begins  a  constant  maneuver. 


Notice  that  the  sensitivity  curves  caused  by  initial  conditions  on 

velocity  (m_2  in  Fig.  11.3-2)  and  line-of-sight  rate  (m.^  in  Fig.  11.3-7) 

are  identical  for  both  of  the  suboptimal  steering  laws.  This  is  due  to  the 

fact  that  the  only  difference  between  ac  (t)  and  ac  (t)  is  the  term  dependent 

2  3 

upon  the  constant  target  acceleration  in  Eq.  (11.2-24),  which  is  zero  for 
all  time  along  the  trajectories  represented  by  these  figures. 


Figures  11. 3-l(a)  through  11. 3-7(a)  correspond  to  an  autopilot 
time  constaiu  of  1  second;  Figs.  11.3-l(b)  through  11.3-7(b)  present 
similar  infoimation  for  a  time  constant  of  0. 1  second.  The  principal  com¬ 
parison  one  can  make  from  these  curves  is  Lhat  miss  distances  for  the 
guidance  systems  having  the  more  rapid  autopilot  response  are  uniformly 
lower.  Of  course,  nothing  is  said  yet  about  the  relative  amounts  of  con¬ 
trol  effort  expended,  but  it  is  not  expected  that  this  contribution  to  the 
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Figure  11.3-1  Normalized  Miss  Distance  for  an  Initial  Cross 

Track  Velocity:  Optimal  Steering  Law  aci(t) 
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Figure  11.3-2  Normalized  Miss  Distance  for  an  Initial  Cross-Track 

Velocity:  Suboptimal  Steering  Laws  acJt)  and  ac3(t) 
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Figure  11.3-5 


Normalized  Miss  Distance  Caused  by  Target 

Acceleration:  Suboptimal  Steering  Law  ac  (t) 
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Figure  11.3-6 


Normalized  Miss  Distance  Caused  by  Initial  Line 
of-Sight  Rate:  Optimal  Steering  Law  ac  (t) 
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Figure  11. 3-7  Normalized  Miss  Distance  Caused  by  Initial 

Line-of-Sight  Rate:  Suboptimal  Steering 
Laws  ac  (t)  and  ac  (t) 


performance  index  in  Eq.  (11.2-17)  will  reverse  the  judgement  that  a  faster 
autopilot  is  preferable.  After  all,  given  a  system  with  no  autopilot  lag,  the 
associated  optimal  steering  law  a^Ct)  does  not  call  for  one  to  be  added. 
Also  it  is  clear  from  inspection  of  individual  figures  that  increasing  the 
weighting  on  the  control  effort  always  increases  the  terminal  miss  for  the 
optimal  steering  law,  and  almost  always  does  so  for  the  suboptimal  steer¬ 
ing  laws.  The  maximum  miss  results  when  initial  conditions  occur  at  a 
time-to-go  on  the  order  of  one  autopilot  time  constant.  Physically  the 
latter  is  true  because  enough  time  remains  for  the  initial  condition  to  cause 
appreciable  miss  if  no  guidance  action  is  taken  while  insufficient  time  re¬ 
mains  for  the  autopilot  to  closely  follow  corrective  steering  commands. 
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i 


Consequently,  it  is  advantageous,  from  the  target’s  point  of  view  (Figs. 
11.3-3  through  11.3-5),  for  the  target  to  begin  its  constant  acceleration 
maneuver  at  this  most  sensitive  time. 


Most  of  the  above  observations  are  qualitative  judgements  which 
are  readily  predictable  on  the  basis  of  what  is  already  known  about  tra¬ 
jectory  sensitivity,  e.g,,  see  Figs.  3. 1-1  through  3. 1-4  of  Section  3.  To 

establish  whether  the  optimal  steering  law  a„  (t)  has  significant  advantages 

L1 

over  the  other,  suboptimal  controls  we  must  determine  the  control  effort 
expended  for  representative  sets  of  initial  conditions.  This  is  the  subject 
of  the  next  section. 


11.3.2  Control  Effort  Sensitivity  to  Initial  Conditions 

The  contrbl  effort  Jtj  expended  by  the  guidance  system  to  null 
initial  conditions,  using  any  one  of  the  steering  laws  under  investigation, 
can  be  determined  by  the  method  described  in  Section  G.4.  Denote  the 
values  of  corresponding  to  ac(t)  =  ac-(t)  by  Ju., 

.  rT  ? 

Ju  =  \  ac(r)  dr;  i  =  M,3  (11.3-13) 

i  Jt  i 

where,  as  before,  t  denotes  the  time  at  which  the  guidance  problem  begins. 

With  application  of  Eq.  (G-31)  Ju  is  given  by 

i 


J  =  x(t)T  C  <t)  x(t); 

i  =  1,2,3 

i 

(11.3-14) 
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where  C.(t)  satisfies  the  linear  matrix  differential  equation 

C.(t)  =  -C.(t)Ai(t)-A1(t)TCi(t)-h.(t)hi(t)T 
C.(T)  =  0  1  =  1, 2,3  (11.3-15) 


with  A^t)  defined  by  Eq.  (11.3-4).  Equation  (11.3-15)  is  evaluated  by 
integrating  backwards  in  time  from  the  known  terminal  value  of  C^T). 

For  the  purpose  of  evaluating  separately  the  effort  caused  by 
initial  conditions  on  each  state  variable,  we  need  to  know  only  the  diagonal 
elements  of  C.(t)  to  perform  the  calculation  in  Eq.  (11. 3-14);  i.  e. , 


J  =  e  (t)x.(t)2  (11.3-16) 

lj  11  1 


where  x^(t)  is  the  initial  condition  on  the  jth  state  variable  at  time  t  with  all 
other  initial  conditions  zero,  c.^(t)  is  the  j**1  diagonal  element  of  Cj,  and 
Ju..  denotes  the  value  of  Ju.  produced  by  Xj(t).  Thus  the  control  effort 
expended  depends  upon  which  steering  law  is  in  use  and  the  magnitude  of 
the  nonzero  initial  condition.  To  facilitate  the  discussion  it  is  convenient 
to  define  a  root  normalized  control  effort,  Ju.  ,  according  to 


(11.3-17) 


For  any  particular  values  of  i  and  j  the  effort  is  given  by 


J  =  J  x. (t)  g j  (11.3-18) 

uij  v  uij  3  ' 
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Physically  (jUy)^  is  the  integral  of  the  square  of  commanded  normal 
acceleration  (in  g’s)  caused  by  a  unit  initial  condition.  Plots  of  the  func¬ 
tions  JUij(tgo)  —  i  =  1,2,3;  j  =  2, 3,  5,  —  corresponding  to  the  miss  sensi¬ 
tivity  curves  in  the  preceding  section  are  given  in  Figs.  11.3-8  through 
11.3-12. 


As  indicated  in  Figs.  11.3-8  and  11.3-9,  Ju-5  can  be  expressed 
in  terms  of  To  derive  this  relationship,  refer  to  Eq.  (11.2-21)  and 

assume  that  y(t)  =  0  at  the  initial  time  t;  therefore 

X(t)  »  JP-  (11.3-19) 

c  go 

Substitution  for  y(t)  from  Eq.  (11. 3-19)  into  Eq.  (11.3-16)  produces 


C12(t) 


.2 


govc  X(t) 


2 


Defining  the  root  normalized  effort  produced  by  the  initial  line-of-sight 
rate  as 


/.  v  .2  2/2 

c.0(t)  t  v  /g 
i2v  go  c '  & 


it  follows  from  Eq.  (11.3-17)  that 


(11.3-20) 


The  curves  corresponding  to  an  initial  cross -track  velocity 
(Figs.  11.3-8  and  11.3-9)  exhibit  increasing  maximum  values  as  the 
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Figure  11.3-8 


Root  Normalized  Effort  Caused  by  Initial  Cross - 
Track  Velocity  or  Initial  Line -of -Sight  Rate: 
Optimal  Steering  Law  ac^(t) 
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Figure  11.3-9 


Root  Normalized  Effort  Caused  by  Initial  Cross- 
Track  Velocity  or  Initial  Line-of-Sight  Rate: 
Suboptimal  Steering  Laws  a^t)  and  aCg(t) 
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Figure  11. 3-10  Root  Normalized  Effort  Caused  by  Target 

Acceleration:  Optimal  Steering  Law  ar<(t) 
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Figure  11.3-11 
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Root  Normalized  Effort  Caused  by  Target 
Acceleration:  Suboptimal  Steering  Law  aCo(t) 
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Figure  11.3-12 


Root  Normalized  Effort  Caused  by  Target 
Acceleration: for  Suboptimal  Steeriig  Law 


ac3(t) 


control  weight  r  is  decreased.  As  expected,  this  trend  is  opposite  to  that 

observed  for  the  normalized  terminal  miss  (Figs.  11.3-1)  and  11.3-2).  The 

sets  of  values  of  r  for  the  cases,  a  =  l  and  o  =  10,  are  different  simply  to 

scale  the  performance  index  so  that  each  case  exhibits  about  the  same 

range  of  variation  for  ^  each  figure  for  Ju^j  the  curves  converge 

toward  zero  as  t  increases  because  there  is  more  time  to  reduce  the 

go 

terminal  miss  produced  by  the  initial  cross -track  velocity  and  therefore 
less  effort  need  be  expended.  Similar  effects  are  observed  for  target 
acceleration  (Figs,  11.3-10  through  11.3-12),  except  that  the  curves  for 
Juig  do  not  converge  to  zero  with  increasing  t^. 


As  suggested  when  justifying  the  choice  of  performance  index  in 


Eq.  (11.2-17),  besides  energy  expenditure  the  value  of  Ju  provides  an 


M 
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indication  of  the  magnitude  of  the  airframe  normal  acceleration  and  con¬ 
trol  surface  deflection.  To  provide  an  estimate  of  the  latter  a  normalized 
root  mean  square  commanded  acceleration,  a..,  can  be  calculated  from 

*ij  "  Kj/Vo  (^’s/unit  state)  (11.3-21) 

If  it  happened  that  the  steering  command  were  constant  over  the  trajectory, 
it  would  have  the  value  a„.  Consequently  the  latter  provides  an  idea  of  the 
order  of  magnitude  of  the  command;  in  fact  it  is  a  lower  bound  on  its  maxi¬ 
mum  magnitude. 


To  illustrate  how  the  normalized  rms  steering  command  can  be 
used  to  estimate  the  demands  on  control  surface  deflection,  recall  that  the 
commanded  normal  acceleration  is  an  input  to  the  autopilot  loop,  as  indi¬ 
cated  in  Fig.  11. 3-13.  Throughout  the  above  discussion  we  have  modeled 
the  overall  transfer  function  T(s)  as  a  simple  first  order  lag  but  in  actual 
practice  it  consists  of  several  components  and  can  have  more  than  one 
dominant  pole.  The  control  surface  deflection  A(s)  is  related  to  commanded 

acceleration  A  (s)  by  the  transfer  function 
c  " 


A(s)  _ 

A^I 


Gc(s)  G^(s) 


4 


neglecting  the  sensor  dynamics.  If  A  (s)  is  known  the  control  surface 

c 

response  can  be  calculated.  In  particular,  if  we  assume  that 


a  (t)  =  a..x.(0) 

c  13  J 

for  a  control  law  ac.(t)  and  a  specified  guidance  initial  condition  x.(0), 
then 
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Figure  11.3-13  Autopilot  Block  Diagram 

a  x.(0)T.(s) 

A(8)  =  '-1  i  - 

In  this  manner  one  can  obtain  some  idea  of  the  character  of  the  response, 
recognizing  that  the  assumption  of  a  constant  input  command  is  a  very 
rough  approximation.  Such  information  is  useful  for  estimating  whether 
the  control  surface  deflection  called  for  by  the  steering  command  exceeds 
the  bounds  imposed  by  mechanical  constraints. 

If  one  is  interested  in  knowing  the  response  of  important  autopilot 
variables  to  steering  commands  in  more  detail  than  outlined  above,  two  pos¬ 
sible  courses  of  action  can  be  taken.  First,  if  detailed  time  behavior  is  de¬ 
sired,  the  actual  autopilot  equations  of  motion  can  be  simulated.  Alterna¬ 
tively,,  if  the  integral  square  values  of  various  quantities  are  desired,  they 
can  be  obtained  using  the  adjoint  techniques  of  Appendix  G;  this  procedure 
is  briefly  outlined  below  for  the  application  illustrated  in  Fig.  11.3-13. 

The  problem  is  to  determine  the  integral  square  values  of  auto¬ 
pilot  state  or  output  variables  for  the  steering  commands  a^t)  given  in 
Eq.  (11.  3-2).  These  quantities  can  be  determined  in  a  manner  similar  to 
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that  used  for  the  performance  index,  J^.  In  particular,  let  the 
autopilot  equations  of  motion  be  written  as 


z(t)  =  Fz(t)+gac(t) 
a(t)  =  cT  z(t) 

y(t)  =  dTz(t)  (11.3-22) 

where  z(t)  is  the  state  of  the  autopilot,  a(t)  is  normal  acceleration,  and  y(t) 
is  any  variable  (e.g.,  normal  acceleration,  control  surface  deflection,  etc.) 
whose  integral  square  value,  J^,  is  desired, 

T 

J  =  (  y(r)2  dr 
y 


(Recall  that  in  Eq.  (11. 3-4)the  autopilot  dynamics  are  assumed  to  be  a 
first  order  lag. )  To  derive  the  actual  equations  of  motion  for  the  guidance 
system  with  the  autopilot  modeled  by  Eq.  (11.3-22),  replace  x4(t)  in  Eq. 
(11.3-4)  by  z(t)  and  combine  the  result  with  Eq.  (11. 3-22)  to  obtain 
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(11.3-23) 


11-41 


THE  ANALYTIC  SCIENCES  CORPORATION 


Equation  (11. 3-23)  represents  the  actual  dynamics  of  the  guidance  and  con¬ 
trol  system  with  steering  laws  specified  by  Eq.  (11.3-2).  Defining 


w(t)T  =  x^t)  x2(t)  x3(t)  z(t)T 


the  equations  of  motion  for  the  guidance  system  can  be  written  compactly  as 

w(t)  =  F.(t)w(t)  (11.3-24) 


and  J  as 

y 


(  w^T\r)  Qw(r)  dr 

Jt 


Q 


'[0]  [0]' 
[0]  ddT 


(11.3-25) 


Equations  (11.3-24)  and  (11. 3-25)  have  the  same  form  as  Eqs.  (G-l)  and 
(G-4);  consequently  Eq.  (G-30)  can  be  used  to  calculate  Jy  in  terms  of  an 
initial  condition  on  w(t), 

Jy  =  w(t)T  D(T,t)  w(t) 

D(T,t)  ^  -  F.(t)TD(T,t)  -  D(T,t)  F.(t)  -  Q  ;  D(T,T)  =  0  (11.3-26) 

The  solution  of  Eq.  (11.3-26)  for  D(T,t)  provides  a  means  for 
calculating  the  integral  square  value  of  any  autopilot  output  variable  defined 
by  d  in  Eq.  (11.3-22).  This  type  of  analysis  permits  a  more  realistic 
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evaluation  of  the  effect  of  a  given  steering  law  on  the  autopilot  than  does 
the  value  of  integral  square  commanded  acceleration. 


11.3.3  Steering  Law  Evaluation 

To  make  a  judgement  about  the  effectiveness  of  one  steering  law 
as  opposed  to  another,  we  need  to  compare  the  values  of  control  effort 
expended  to  produce  a  given  level  of  normalized  terminal  miss.  In  making 
such  a  comparison  the  time-to-go,  i.e.,  the  time  of  missile  launch  or  the 
time  a  target  maneuver  occurs,  must  be  considered.  One  procedure  for 
doing  this  is  to  compare  miss  distances  and  effort  expended  for  different 
control  weightings  r  at  values  of  t  which  are  related  to  the  autopilot  lag; 
e.  g.,  take  t  =  1/a,  2/a, . .  ,  n/a.  Figure  11  „  3-14(a)  is  a  graph  of  the 
normalized  effort  defined  in  the  preceding  section,  versus  the  magnitude 
of  the  normalized  miss  for  a  one  second  autopilot  time  constant,  using  both  op¬ 
timal  and  suboptimai  control  laws,  required  to  remove  an  initial  cross -track 
velocity.  The  data  for  these  curves  is  read  from  Figs.  11.3-l(a),  11.  3-2(«\., 
11.3-8(a)  and  11.3-9(a)  at  the  point  t  =  1.0.  The  significance  of  this  plot 
is  that  if  the  missile  is  launched  at  a  point  one  second  away  from  the  target 
with  a  unit  cross -track  relative  velocity,  then  the  curves  give  the  resulting 
terminal  miss  and  control  effort  expended  using  various  steering  laws.  The 
value  of  the  weighting  constant  r  required  to  calculate  each  steering  com¬ 
mand  is  read  as  a  parameter  along  the  individual  curves.  This  plot  also 
indicates  the  degree  of  improvement  obtained  from  the  optimal  steering  law 
relative  to  the  suboptimai  methods.  For  example,  if  one  can  tolerate  a 
normalized  terminal  miss  no  greater*  than  0.25,  the  superiority  of  the 
_ 

A  specification  on  normalized  terminal  miss  can  be  arrived  at  through 
knowledge  of  the  missile's  velocity,  likely  target  velocities,  and  likely 
launch  heading  angles.  An  illustration  of  how  this  can  be  done  is  given 
below  in  Example  11.3-1. 
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optimal  steering  law  in  terms  of  control  effort  expended  is  measured  by  the 

difference  Ju  (.25)  -JUl0(.25)  in  Fig.  11.3-14(a),  which  has  a  value  of 0.290. 
22  l" 

A  significant  observation  is  that  an  arbitrarily  small  miss- 
distance  cannot  be  achieved  using  either  suboptimal  steering  law.  Recall 
from  Eq.  (11. 2-28)  that  in  the  limit  of  zero  control  weighting,  ac3(t) 
becomes  proportional  steering  with  p=  3.  The  normalized  miss  achieved 
with  the  latter  for  a  unit  initial  cross  track  relative  velocity  can  be  read 
from  Fig.  3.1-4,  using  the  fact  that 


X(0) 


=  m 

r 

go 


It  provides  a  lower  bound  for  the  normalized  misses  and  m^  obtained 
with  the  suboptimal  steering  laws,  and  it  is  achieved  only  in  the  limit  as 
r  approaches  0.  With  t  =  1  sec  and  a  =  1.0,  the  bound  has  the  value  0. 18 
as  indicated  in  Fig.  11.3-14(a). 


If  no  steering  at  all  is  used  (ac(t)  =  0),  the  normalized  terminal 
miss  is  numerically  equal  to  tgQ.*  Consequently  the  curves  in  Fig. 
11.3-14(a)  all  terminate  on  the  abscissa  at  m12  =  1.0.  An  appreciable 
advantage  is  gained  from  optimal  steering  only  when  the  desired  normal¬ 
ized  miss  is  significantly  smaller  than  the  "no-steering"  value. 


In  a  given  application,  graphs  like  Fig.  11.3-14(a)  should  be 

derived  for  several  possible  values  of  t  at  which  the  missile  might  be 

launched.  As  t  increases  relative  to  the  autopilot  time  constant,  the 
go 

absolute  values  of  control  effort  required  to  achieve  a  given  miss  distance 
become  smaller,  as  illustrated  by  Fig.  11 . 3-14(b)  for  t^  =  4.0.  This 


* 

li  ac(t)  =  n,  the  normalized  miss  is  equal  to  the  product 
(1  ft/ sec)  (tgp). 
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behavior  is  simply  evidence  of  the  fact  that  the  longer  the  time  until  inter¬ 
cept,  the  more  easily  the  guidance  system  can  reduce  the  effects  of  initial 
condition  errors. 

Diagrams  similar  to  Figs.  11.3-14(a)  and  (b)  are  displayed  for 
the  faster  autopilot  having  a  natural  frequency  of  10  rad/sec  in  Figs. 

11.3- 14(c)  and  (d).  The  data  for  the  curves  are  obtained  from  Figs. 

11.3- l(b),  11.3-2(b),  11.3-8(b),  and  11.  3-9(b).  As  one  expects,  the  mag¬ 
nitudes  of  the  effort  and  normalized  miss  are  generally  less  than  when 

a  =  1.0. 


In  addition  to  looking  at  the  various  terms  contributing  to  the 
performance  index  in  Eq.  (11.2-17)  one  should  insure  that  the  absolute 
control  magnitude  is  not  excessive.  This  can  be  done  exactly  only  by 
plotting  the  control  as  a  function  of  time.  However,  an  indication  of  con¬ 
trol  level  can  be  obtained  from  the  normalized  root  mean  square  com¬ 
manded  acceleration  level  defined  in  Eq.  (11.3-21).  Values  of  this  quantity 
are  simply  obtained  by  scaling  the  ordinate  in  Fig.  11. 3-14  by  the  factor 

(t  r1^2 

"go'  • 

To  summarize  the  evaluation  of  the  guidance  system  response 
to  relative  cross-track  velocity  initial  conditions,  we  shall  illustrate  the 
use  of  the  graphs  in  Fig.  11.3-14  with  an  example: 


Example  11.3-1  —  Consider  the  situation  in  Fig.  3.1-5  when  the 
target  has  a  velocity  of  1000  feet/second.  The  relative  cross-track  velocity 
at  launch  is  about  700  feet/second.  To  achieve  a  miss  of  seven  feet,  a 
normalized  miss  of  about  0.01  is  required.  To  see  whether  the  various 
steering  laws  can  achieve  this  level  of  accuracy,  values  of  root  normalized 
effort  and  normalized  rms  commanded  acceleration  for  the  various  cases 
represented  in  Fig.  11.3-14  are  tabulated  in  Table  11.3-1.  The  symbol  * 
denotes  the  fact  that  the  desired  miss  is  not  achievable.  When  the  cross - 
track  velocity  is  700  feet  per  second,  the  rms  commanded  acceleration 
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TABLE  11.3-1 


STEERING  LAW  PERFORMANCE  REQUIRED  TO  ACHIEVE 
A  NORMALIZED  TERMINAL  MISS  OF  .01  FOR  A 
CROSS- TRACK  VELOCITY  INITIAL  CONDITION 


Optimal  Steering 

Root  Normalized 
Effort,  J 

ui2 

Normalized  rms 
commanded 

acceleration,  2 

(g's/ft/sec) 

Q 

II 

H- * 

JD 

t  =  1.0 

go 

0.18 

0.18 

a  =  1.0, 

t  =  4.0 
go 

0.04 

0.02 

a  =  10, 

t  =0.1. 
go 

0.525 

1.66 

a  =  10, 

t  =0.6 
go 

0.10 

0.13 

Suboptimal  Steering 

J  ,  J 
u22  u32 

a22’  a32 

a  =  1.0, 

t  =  1.0 
go 

★ 

* 

a  =  1.0, 

t  =4.0 
go 

* 

* 

o  =  10, 

t  =0.1 
go 

* 

* 

a  =  10, 

t  =  0.6 
go 

0.112 

0.142 

% 

denotes  "not  achievable". 


required  is  given  by  product  (a^  x  700).  In  all  cases  except  one  (optimal 
steering,  a  =  1.0,  tgo  =  4,0)  this  quantity  is  greater  than  90  g’s.  This 
information  together  with  an  autopilot  analysis  of  the  sort  suggested  at 
the  end  of  Section  11.3.2  enables  one  to  make  a  decision  about  which  steer¬ 
ing  law  to  use  when  a  range  of  possible  launch  conditions  is  known. 
Furthermore  one  can  determine  whether  a  given  specification  is  reason¬ 
able;  it  may  be  physically  impossible  to  destroy  some  targets  with  a  given 
autopilot. 

The  effects  of  initial  cross-track  velocity  are  most  relevant  to  dogfight 
applications  where  the  pilot  of  a  launch  aircraft  may  be  cluse  to  the  target. 


11-47 


An  analysis,  such  as  that  presented  in  the  above  example,  aids  one  in 
obtaining  the  range  of  launch  conditions,  i.e.,  the  "launch  envelope, "within 
which  the  launch  aircraft  must  remain  to  make  a  successful  attack. 

We  emphasize  that  throughout  this  chapter  perfect  knowledge  of 
state  variables  in  mechanizing  the  steering  laws  is  assumed.  In  practice, 
appreciable  measurement  noise  usually  exists  in  the  steering  loop  which 
should  be  included  in  a  complete  performance  evaluation.  The  deter¬ 
ministic  system  analysis  chiefly  provides  qualitative  information  about  the 
ultimate  performance  that  can  be  expected  from  a  steering  law. 

Turning  now  to  the  effect  of  target  maneuvers,  a  different  cri¬ 
terion  from  that  used  above  is  suggested  for  comparing  normalized  miss 
and  normalized  effort  associated  with  different  steering  laws.  For  a  given 
value  of  the  weighting  constant  r  we  choose  that  value  of  t  which  maxi¬ 
mizes  the  terminal  miss.  Thus  each  steering  law  is  being  evaluated  at 
those  points  where  sudden  application  of  a  target  maneuver  has  the  maxi¬ 
mum  adverse  effect  upon  guidance  accuracy.  These  curves  are  relevant 
for  any  tactical  situation  because  the  target  maneuver  can  begin  anytime 
after  the  missile  is  launched. 

Referring  to  Figs.  11.3-3(a)  and  11.3  10(a)  which  illustrate  the 
effects  of  a  target  maneuver,  we  want  to  tabulate  or  graph  pairs  of  values 
of  the  maximum  and  its  associated  normalized  effort.  To  illustiate 
the  procedure,  in  Fig,  11.3-3(a)  note  that  for  r  =  0.2,  m13  is  0.9  at 
tg0  =  0. 3.  Entering  the  graph  in  Fig.  11. 3-10(a)  at  t  =  0.3,  read  out 
Ju  =  1.6.  This  "point",  (m13,  JUl3)  =  (0.9,  1.6),  and  the  two  others 
that  can  be  obtained  from  these  figures  for  r  =  5.0  and  r  =  50  are  plotted 
and  joined  by  a  smooth  curve  labeled  i  =  1  in  Fig.  11.3-15(a).  A  similar 
plot  is  made  in  the  same  figure  for  the  suboptimal  control  laws  aC2(t)  and 
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t-lCS! 


Figure  11. 3-15  Root  Normalized  Effort  Versus  Peak  Normalized 

Miss  Produced  by  Target  Acceleration 


a^t)  from  the  data  in  Figs.  11.3-4(a),  11.3-ll(a),  11.3-5(a),  and 
11.3-12(a).  The  process  is  repeated  in  Fig.  11. 3-15(b)  for  the  case  when 
the  autopilot  time  constant  is  0. 1  sec. 

in  each  of  these  grapns,  the  performance  of  the  suboptimal 
steering  law  aCg(t)  approaches  that  of  proportional  guidance  as  the  control 
weighting  r  approaches  zero.  Evidently  the  greatest  relative  improve¬ 
ment  to  guidance  system  performance  is  achieved  by  using  aC2(t)  which 
includes  compensation  for  the  target  maneuver.  The  additional  advantage 
achieved  by  using  the  optimal  steering  law  aCi(t)  is  proportionately  not  so 
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large  until  relatively  small  values  of  normalized  miss  are  desired.  The 
following  example  illustrates  the  use  of  Fig.  11.3-15. 


Example  11. 3-2  —  Suppose  that  one  expects  to  encounter  targets 
which  can  develop  a  normal  acceleration  up  to  10  g’s  and  insists  upon  a 
steering  law  that  yields  a  peak  terminal  miss  of  5  feet.  The  resulting 
required  peak  normalized  miss  in  feet/g  is  0. 5.  The  corresponding  values 
of  normalized  effort  and  normalized  rms  control  level  are  read  from  Fig. 
11.3-15  and  tabulated  in  Table  11. 3-2.  For  the  slower  autopilot  the 
optimal  steering  law  is  significantly  superior;  it  requires  an  rms  com¬ 
manded  acceleration  of  34  g's  with  a  target  maneuver  of  10  g’s.  For  the 
faster  autopilot,  there  is  little  performance  improvement  in  using  ac^(t) 
instead  of  aC2(t).  However,  aC2(t)  is  distinctly  preferable  to  ac3(t). 


TABLE  11.3-2 

STEERING  LAW  PERFORMANCE  REQUIRED  TO  ACHIEVE  A 
PEAK  NORMALIZED  TERMINAL  MISS  OF  0. 5  IN  THE 
PRESENCE  OF  TARGET  ACCELERATION 


Optimal  Steering 

Root  Normalized 
Effort,  J 

u13 

Normalized  rms 
:ommanded_ 
acceleration,  a-io 
(g’s/g) 

a  =  1,0 

1.7 

3.4 

Or  =  10.0 

0.4 

0.8 

Suboptimal  Steering  (a^t)) 

J 

u23 

a23 

o  -  1.0 

very  high;  >20 

very  high;  >20 

a  =  10.0 

0.44 

0.88 

Suboptimal  Steering  (a^ft )j 

J 

u33 

a33 

cr  =  1.0 

* 

* 

CT  =  10.0 

1.44 

1.2 

* 

denotes  "not  achievable". 
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11.3.4  Terminal  Miss  Sensitivity  to  Measurement  Bias  Errors 

The  effects  of  measurement  bias  errors  upon  terminal  miss 
distance  can  also  be  determined  using  adjoint  techniques,  as  described  in 
Section  G-3.  If  a  constant  error  £  is  made  in  the  measurement  of  the 
state  x(t)  beginning  at  time  t,  the  resulting  terminal  miss  Xj(T)  is  given  by 

Xj(T)  =  i£i(T,t)T1;  i  =  1,2,3 

where  the  subscript  i  still  refers  to  the  steering  law  being  used.  The 
differential  equation  for  */).(T, t)  is  obtained  by  comparing  Eq.  (G-18)  with 
Eqs.  (11.3-1)  and  (11.3-2), 

ij(T,t)  =  h.(t)b%.(T,t);  4>.(T.T)  =  0 

where  h.(t)  is  defined  in  Eq.  (11.3-3)  and  ^(T,t)  is  defined  by  Eq.  (11.3-7). 

The  value  of  any  element  of  ^(T,t)  at  time  t  is  the  miss  produced 

by  a  unit  bias  error  in  the  corresponding  measurement  of  the  state  and  zero 

error  in  all  other  state  measurements.  All  state  initial  conditions  at  time 

t  are  also  assumed  to  be  zero.  In  other  words,  the  jth  element  of  ^(Tjt) 

is  the  normalized  miss  b..(T,t)  produced  by  a  single  bias  error  in  the 

'J 

measurement  of  state  x.. 

J 

Values  of  the  normalized  miss  versus  time-to-go  for  a  target 
acceleration  measurement  error  are  plotted  in  Figs.  11.3-16  and  11.3-17 
for  the  same  values  of  autopilot  time  constant  and  control  weighting  used 
in  the  preceding  graphs.  Qualitatively  it  is  seen  that  the  miss  becomes 
smaller  as  the  control  weighting  r  decreases.  This  is  true  because  the 
decrease  in  r  increases  the  feedback  gains  in  the  steering  law,  causing 
more  rapid  guidance  system  response  to  the  error  in  iine-of-sight  rate 
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Figure  11.3-16 
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Normalized  Miss  Distance  Caused  by  Target 
Acceleration  Measurement  Bias  Error: 
Optimal  Steering  Law,  ari(t) 
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( b)  c  •  10.0  »•{■' 

Normalized  Miss  Distance  Caused  by  Target 
Acceleration  Measurement  Bias  Error: 
Suboptimal  Steering  Law,  aCo(t) 
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induced  by  the  bias  error,  thus  improving  terminal  accuracy.  An  increase 
in  or  also  improves  accuracy  because  the  autopilot  lag  decreases. 


One  useful  comparison  of  the  effects  of  bias  errors  on  the 
accuracy  of  the  steering  laws  is  obtained  by  plotting  the  peak  normalized 
miss  from  Figs.  11.3-16  and  11.3-17*  versus  peak  normalized  miss 
caused  by  constant  target  acceleration  in  Figs.  11.3-3  and  11.3-4  for 
various  values  of  the  weighting  r.  The  cases,  a  =  1  and  a  =  10,  are  dis¬ 
played  separately  in  Figs.  11.3-18(a)  and  (b).  Based  on  the  comparison 
between  the  curves  for  these  values  of  o  there  is  little  difference  in  sen¬ 
sitivity  to  bias  measurement  errors  between  optimal  and  suboptimal  steer¬ 
ing  laws.  In  the  case  a  =  10  the  curves  are  nearly  superimposed  on  one 

another.  This  is  to  be  expected  because  the  optimal  steering  law  a„  (t) 

1 

approaches  the  suboptimal  law  a^t)  as  the  autopilot  time  constant  gets 
smaller. 


An  alternative  comparison  is  afforded  between  the  steering  laws 
by  the  steady  state  values  of  b.^  incurred  as  time-to-go  becomes  large. 
Figures  11.3-13  and  11.3-17  indicate  that  the  optimal  steering  law  is  more 
sensitive  to  measurement  bias  errors  in  terms  of  the  steady  state  values 
of  normalized  terminal  miss.  This  fact  can  have  design  implications  for 
missiles  which  are  launched  at  a  time-to-go  equal  to  several  autopilot 
time  constants.  Because  ac^(t)  has  no  built-in  criterion  for  insuring  an 
optimal  response  in  the  presence  of  bias  errors,  theie  is  no  reason  to 
expect  it  to  yield  the  best  performance  when  such  errors  occur. 


In  Fig.  11.3-16  the  peak  normalized  miss  is  taken  to  fcj  the 
extrapolated  steady  state  values  of  the  curves. 
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Figure  11.3-18  Peak  Normalized  Miss,  Produced  by  Target 

Acceleration  Measurement  Bias  Error,  Versus 
Peak  Normalized  Miss  Caused  by  Target 
Acceleration 


11.3.  5  Summary 

This  concludes  our  sensitivity  analysis  of  the  optimal  and  sub- 
optimal  steering  laws  derived  in  Section  11.2.  Its  main  purpose  is  to  in¬ 
dicate  an  approach  to  evaluating  different  control  policies  in  terms  of  the 
design  objectives  of  an  optimal  control  problem.  Graphs  of  normalized 
miss  and  control  effort  of  the  type  depicted  in  Figs.  11.3-14,  11.3-15  and 
11.3-18  are  the  principal  tools  for  making  the  performance  .analysis. 
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These  curves  can  be  used  to  determine  actual  values  of  miss  distance  and 
control  effort  if  multiplied  by  an  assumed  value  of  the  corresponding  initial 
error  --  e.g.,  cross-track  velocity,  target  acceleration  or  bias  error. 
Beyond  this  quantitative  comparison  of  the  specific  steering  laws, 
some  general  qualitative  conclusions  are: 


•  The  faster  the  autopilot  response,  the  smaller  the 
absolute  gain  in  terminal  accuracy  using  optimal 
steering,  ac^(t),  as  compared  with  aC2(t). 

•  In  the  presence  of  target  maneuvers,  suboptimal 
steering  ac«(t)  offers  a  significant  improvement 
over  ac3(t)  oe cause  the  latter  has  no  provision 
for  measuring  target  acceleration. 

•  In  the  presence  of  measurement  bias  errors  the 
optimal  steering  law  can  exhibit  worse  perform¬ 
ance  than  a  suboptimal  policy,  depending  upon 
the  criteria  used  for  evaluation.  In  particular, 
the  terminal  miss  caused  by  a  bias  error  for  a 
missile  launched  at  a  time-to-go  equal  to  several 
autopilot  time  constants  away  from  the  target  is 
larger  when  optimal  steering  is  used. 


For  a  given  autopilot  lag,  a  definitive  judgement  about  which 
steering  law  to  use  can  be  made  only  after  considering  a  spectrum  of  pos¬ 
sible  initial  values  of  time-to-go  which  are  characteristic  of  the  particu¬ 
lar  application.  For  a  dogfight  mission,  sets  of  curves  of  the  type  shown 
in  Figs.  11.3-14  and  11.3-18  should  be  obtained  for  values  of  t  ranging 
over  those  values  expecied  at  launch.  In  missions  where  launch  takes 
place  at  relatively  long  ranges  from  the  target,  initial  cross-track  velocity 
(Fig.  11.3-14)  is  not  so  important  but  the  steady  state  effects  of  measure¬ 
ment  errors  (e.g.,Fig.  11.3-18)  must  be  evaluated.  In  addition  to  the  bias 
errors  treated  above,  the  effects  of  continuously  varying  random  measure¬ 
ment  errors  ("noise")  should  be  analyzed  as  outlined  in  Section  G.3. 
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Similarly,  graphs  like  those  in  Fig.  11.3-15  can  be  obtained  for  a  set  of 
values  of  t  over  the  range  in  which  target  maneuvers  are  expected  to 
occur,  usually  within  a  few  seconds  before  intercept.  Performance  data 
displayed  in  this  fashion  yield  enough  quantitative  information  to  permit  a 
rational  choice  of  steering  law. 

The  performance  criteria  throughout  the  above  discussion  are 
integral  square  control  and  the  squared  terminal  miss.  The  former  is 
somewhat  inappropriate  because  excessive  peak  control  magnitudes  can  be 
required.  The  normalized  root  mean  square  acceleration  defined  in  Eq. 
(11.3-21)  gives  an  optimistic  bound  on  the  commanded  acceleration  level. 
For  any  application  it  should  be  checked  by  actually  computing  the  steer¬ 
ing  command  as  a  function  of  time  for  representative  cases. 


11.4  A  COUPLED  GUIDANCE -AUTOPILOT  STEERING  LAW 

The  statement  is  made  in  Section  2. 1  that  the  steering  and  auto¬ 
pilot  control  loops  for  a  tactical  missile  are  usually  treated  as  separate 
entities  because  their  associated  response  times  are  considerably  different. 
This  is  a  valid  procedure  when  time -to- go  until  intercept  is  large  with 
respect  to  the  autopilot  response  time;  however,  when  the  latter  condition 
does  not  hold,  performance  can  be  improved  by  considering  guidance  and 
control  as  coupled  functions. 

The  effect  of  including  autopilot  dynamics  (assumed  to  be  a  first 
order  lag)  in  designing  an  optimal  steering  law  is  demonstrated  in 
Sections  11.2  and  11.3.  Only  a  partially  coupled  design  is  considered 
there  in  that  the  autopilot  characteristics  are  taken  into  account  by  the 
guidance  law  but  the  guidance  dynamics  are  not  taken  into  account  when 
designing  the  autopilot.  It  is  shown  that  an  optimal  steering  law  which 
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accounts  for  the  autopilot  lag  is  significantly  better  than  a  suboptimal  law 
that  neglects  autopilot  dynamics  if  the  initial  time-to-go*  is  of  the  same 
order  of  magnitude  as  the  autopilot  time  constant.  In  this  section  we 
determine  what  further  performance  benefits  can  be  gained  by  treating  the 
autopilot  and  steering  law  design  tasks  as  completely  coupled  problems. 


11.4.1  Problem  Formulation 


hi  this  section  we  formulate  the  optimal  guidance-control  prob¬ 
lem  in  a  manner  similar  to  that  used  in  Section  11.2.2.  Both  optimal  and 
suboptimal  control  laws  are  derived  for  comparison  purposes. 

Our  first  task  is  to  develop  a  set  of  state  equations  which  des¬ 
cribe  the  dynamics  of  the  overall  system.  The  missile  airframe  dynamics 
are  taken  to  be  second  order,  as  in  Eqs.  (9.2-9)  and  (9.2-10)  (the  actuator 
dynamics  are  neglected).  To  obtain  the  equations  of  motion  it  is  conven¬ 
ient  to  define  a  new  state  variable 


a(t) '  =  VLao(t)  (11.4-1) 

Combining  Eqs.  (9.2-9),  (9.2-10)  and  (11.4-1)  with  the  missile  transla¬ 
tional  dynamics  from  Eq.  (11.2-16)  produces 


3|c 

The  initial  time-to-go  is  regarded  as  the  point  when  the  target  begins 
a  constant  maneuver  or,  in  the  case  of  an  initial  cross-track  velocity, 
the  time  at  which  the  missile  is  launched. 
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z(t)  =  Fz(t)+g6(t) 

z(t)T  =  [y(t)  y(t)  at(t)  q(t)  a(t)']  =  [z^t)  z2(t)  z3(t)  z4(t)  zg(t)] 


'0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

-1 
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0 
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m6 

0 
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0 

VL 

<y 

-  La 

- 

;VL6L« 

Thus  Eq.  (11. 2-16)  is  altered  by  replacing  x4(t)  with  two  airframe  state 
variables  and  the  control  variable  becomes  control  surface  deflection,  6(t), 
instead  of  commanded  acceleration.  By  analogy  with  Eq.  (11.2-17)  the 
design  objective  is  to  choose  a  feedback  control  law  that  minimizes  the 
quadratic  performance  index 


(11.4-3) 


As  before,  z^(T)  is  the  terminal  miss.  However  the  integral 
expression  in  Eq.  (11.4-3)  is  not  stated  in  terms  of  commanded  accelera¬ 
tion;  instead  it  is  the  integral  of  the  square  of  the  control  surface  deflec¬ 
tion.  Consequently  the  contribution  of  the  control  effort  to  the  perform¬ 
ance  index  has  a  different  physical  significance  here  than  it  does  in  Eq. 
(11.2-17).  If  the  control  surface  actuator  is  driven  electromagnetically, 
the  effort  in  Eq,  (11.4-3)  is  proportional  to  the  energy  consumed.  If  the 

actuators  are  hydraulically  driven,  the  quantity 

T 

f  6(t)2  dt 

JQ 
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is  a  better  measure  of  energy  expenditure;  however,  it  is  usually  observed 
that  increasing  r  in  Eq.  (11.4-3)  also  tends  to  reduce  the  value  of  the 

Vs 

integral -square  control  surface  rate.  Therefore  the  above  expression  for 
J  has  useful  properties  for  most  actuator  systems.  With  the  above  qualifi¬ 
cations  in  mind,  we  shall  speak  of  the  control  penalty  term  in  Eq.  (11.4-3) 
as  a  measure  of  actuation  energy. 

Because  the  above  optimal  control  problem  has  the  same  struc¬ 
ture  as  that  discussed  in  Section  11.2,  analytical  solutions  for  the  optimal, 
time-varying  feedback  gains  can  be  obtained  using  the  procedures  des¬ 
cribed  in  Refs.  7  and  136.  However,  we  are  primarily  interested  in 
evaluating  the  performance  of  the  guidance  system,  a  task  that  must  be 
accomplished  numerically.  Closed  form  expressions  for  the  feedback 
gains  offer  no  particular  advantage  for  this  purpose;  therefore  they  are 
not  presented  here.  From  the  known  properties  of  the  optimal  regulator 
problem  (see  Appendix  B)  the  solution  for  control  surface  deflection  that 
minimizes  J  in  Eq,  (11.4-3)  is  given  by 


«X<t>  -  '•k1(t)Tz(t) 

kl(t)T  -  j-  gTS(t) 
c 


where  the  matrix  S(t)  is  the  solution  to 


S(t)  =  -  S(t)  F  -  FTS(t)  +  ±  S(t)  ggT  S(t) 

C 


S(T)  = 


1  0 
0  0 


0 

0 


(11.4-4) 


(11.4-5) 


0 


0 
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It  is  desirable  to  compare  the  optimal  control  law  given  above 
with  the  steering  law,  ac^(t),  derived  in  Section  11.2  under  the  assumption 
that  the  autopilot  is  already  designed  with  dynamics  given  approximately  by 
a  first  order  lag.  Recall  that  the  latter  is  an  optimal  steering  policy  for 
minimizing  J  in  Eq.  (11.2-17);  however  it  is  not  optimal  for  the  problem 
posed  in  this  section.  Because  ac^(t)  is  a  commanded  acceleration  and  6(t) 
is  a  control  surface  deflection,  some  algebraic  manipulation  is  required  to 
obtain  the  associated  suboptimal  control  62(t).  This  is  accomplished  by 
defining  the  suboptimal  control  law  according  to 

62(t)  =  -  Cj q(t)  -  c2a(t)'  +  cgac  (t)  (11.4-6) 

The  fixed  gains  cj  and  C2  are  chosen  to  provide  a  stabilized  airframe  hav¬ 
ing  specified  response  characteristics  and  c3  is  selected  so  that  the  d-c 
gain  between  ac^(t)  and  the  normal  acceleration  of  the  airframe,  a(t),  is 
unity.  The  implementation  of  Eq.  (11.4-6)  is  illustrated  in  Fig.  11.4-1. 
Equation  (11.4-6)  simulates  the  application  of  the  steering  law  aCj(t), 
derived  assuming  a  first  order  autopilot,  to  the  actual  second  order  auto¬ 
pilot. 

Recall  that  aCl(t)  is  derived  assuming  the  transfer  function  G(s) 
in  Fig.  11.4-1  is  of  the  form 

g 

s  +  g 

representing  a  first  order  lag.  The  value  of  g  is  required  for  mechanizing 
the  steering  law.  However,  in  the  case  treated  here  the  autopilot  is  actually 
second  order.  Consequently  the  first  order  lag  can  be  considered  only  as 
an  approximation.  With  the  fixed  gain  compensation  provided  by  c.  and  Cg 
in  Fig.  11. 4-1  the  autopilot  closed  loop  poles  are  obtained  by  substituting 
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Figure  11.4-1  A  Second  Order  Autopilot  Design 

for  a  Suboptimal  Steering  Law 


6g(t)  from  Eq.  (11.4-6)  for  6(t)  in  Eq.  (11.4-2)  and  considering  only  the 
dynamics  of  z4(t)  and  z^(t);  these  poles  are  the  eigenvalues  of  the  matrix 


M  -  M.c. 
q  6  1 

VLa(l+ClL6) 


M 


a 


VL  "  M6C2 
a 


-  L  +  VL  Lac„ 
a  a  o  2 


(11.4-7) 


We  define  -o  to  be  the  real  part  of  that  eigenvalue  of  Fa  which  has  the 
smallest  magnitude;  i.  e. , 


a  =  -  sign 


Re(Xj)  min  Re  (\.) 
J  i 


i  =  1,2 


(11.4-8) 


This  is  certainly  not  the  only  rational  choice  possible  for  a.  If  the  closed 
loop  poles  are  close  together  a  smaller  value  than  that  given  by  Eq.(11.4-8) 
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may  be  better.  However  it  seems  to  be  a  reasonable  selection  when  the 
poles  have  sizeable  imaginary  parts. 

To  express  the  suboptimal  control  6^(t)  in  terms  of  the  state 
variables  z(t)  defined  in  Ea  ^11 . 4-2),  a  relation  is  needed  for  the  air¬ 
frame  acceleration  a(t)  which  is  rocmired  to  generate  a„  (t).  From  the 
airframe  equations  of  motion  --  Eq.  (8. 1-3)  and  (11.4-1^  —  with  actuator 
dynamics  neglected,  it  can  be  shown  that 

a(t)  =  a(t) '  -r  VL5  6(t)  (11.4-9) 

Combining  Eqs.  (11.2-19),  (11.4-6),  and  (11.4-9),  6g(t)  can  be  written  as 


V't)  =  5.^) 


c3hl(t) 

c?h2(t) 

c3h3(t) 


ic2+c3h4(t) 


i+c3h4-(tyvr 


(11.4-10) 


where  the  gains  h.(t),  i  =  1,  2,  3,  4,  are  calculated  from  Eq.  (11.2-19). 

Equations  (11.4-4)  and  (11.4-10)  provide  respectively  optimal 
and  suboptimal  control  laws  for  the  system  in  Eq.  (11.4-2).  The  former 
is  a  result  of  considering  the  tasxs  of  autopilot  and  steering  law  design  as 
a  coupled  control  problem.  By  comparison  the  suboptimal  control  law 
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does  not  take  the  guidance  function  into  account  when  designing  the  autopilot, 
and  it  approximates  the  autopilot  second  order  dynamics  as  a  first  order 
lag  in  deriving  the  steering  law.  In  tne  next  section  the  relative  perform¬ 
ance  of  these  two  control  policies  is  compared  and  evaluated,  using  the 
same  sensitivity  analysis  techniques  applied  in  Section  11.3. 


11.4.2  Performance  Analysis 

By  analogy  with  Eq.  (11. 3-4),  the  closed  loop  form  of  Eq. 
(11.4-2)  is 


z(t)  =  F.(t)z(t) 

F.(t)  =  F-gk.(t)T  ;  1-1,2  (11.4-11) 

The  terminal  miss  z^T)  caused  by  a  nonzero  initial  value  of  z(t)  is  cal¬ 
culated  from  the  equations, 


ZjCr)  =  2.(T,t)Tz(t) 


Oj(T,t)  =  -F.(t)T£.(T,t) 


(11.4-12) 
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As  in  Section  11.3.1  we  define  m..(T,t)  to  be  the  normalized  miss  caused 

1L  __ 

by  a  unit  initial  value  for  the  j  n  element  of  z;  i.e.,  m^(T,t)  is  the  j**1 
element  of  <p^(T,  t).  Only  the  characteristics  of  the  sensitivity  to  cross¬ 
track  velocity,  m^C^t),  are  displayed  here  to  illustrate  the  type  of 
performance  achieved  with  this  optimal  guidance  law.  It  is  expected  that 
conclusions  for  this  case  will  also  hold  qualitatively  for  other  sensitivities 
of  interest. 


Figure  11.4-2(a)  shows  curves  of  the  normalized  miss  in^  for 
the  optimal  control  law  6j(t)  given  in  Eq.  (11.4-4).  Curves  for  m 22  cor¬ 
responding  to  the  suboptimal  control  law  in  Eq.  (11.4-10)  are  given  in 
Figs.  11. 4 -1(b)  and  (c)  for  two  different  sets  of  parameter  values  given  in 
Table  11.4-1  below.  The  first  set  (case  #1)  has  values  of  fixed  feedback 
gains  ci  and  C2  which  produce  a  relatively  slowly  responding  autopilot 
having  atime  constant  of  0.83  sec;  the  second  set  (case  #2)  yields  a  faster 
response  with  a  time  constant  of  0. 13  sec.  The  values  of  the  airframe 
parameters  --  M^,  M^,  M^,  L^,  Lfi,  and  V  --  are  taken  from  Appendix  H, 
Table  H-2,  flight  condition  4.  The  curves  for  m^  correspond  to  different 

values  of  the  performance  index  weighting  r  in  Eq.  (11.4-3)  and  those  for 

c 

nigg  to  values  of  the  weighting  r  in  Eq.  (11.2-17). 

Qualitatively,  the  curves  in  Fig.  11.4-2  are  much  the  same  as 
those  in  Figs.  11.3-1  and  11.3-2  with  respect  to  their  dependence  on  r  , 
r,  and  t  .  However,  the  oscillations  in  the  curves  corresponding  to  the 
suboptimal  control  law  (Figs.  11.4-2(b)  and  (c))  are  much  more  persistent 
than  we  have  encountered  previously.  This  behavior  results  from  the  fact 
that  6g(t)  is  derived  assuming  first  order  autopilot  dynamics;  however,  the 
actual  dynamics  prescribed  by  Eq,  (li. 4-7)  are  second  order  and  their 
associated  closed  loop  poles  have  significant  imaginary  parts.  The  situa¬ 
tion  is  quantitatively  pictured  in  Fig.  11.4-3  where  the  actual  autopilot 
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TABLE  11.4-1 

PARAMETER  VALUES  FOR  SUBOPTIMAL 
CONTROL  LAW  62(t) 


Suboptimal 
Control  Law 

Case  No. 

Parameter 

#1 

#2 

C1 

2.31  x  10“2 

0.198 

C2 

1.91  x  10”5 

1.42  x  10'3 

C3 

-1.61  x  10"4 

-1.65  x  10“3 

l/o(sec) 

0.83 

0.13 

J 

poles  and  the  assumed  first  order  pole,  -a,  are  displayed  for  both  sets 
of  control  parameters  in  Table  11.4-1.  Evidently  there  is  reason  to  expect 
that  the  first  order  lag  is  ;.:ot  always  a  good  approximation,  especially  for 
case  #1  where  the  autopilot  dynamics  are  more  nearly  those  of  a  harmonic 
oscillator.  The  significance  of  these  observations  is  thht  an  ’’optimal*’ 
guidance  law  based  on  an  inaccurate  description  of  autopilot  dynamics  may 
not  perform  nearly  so  well  as  expected  from  the  analysis. 

In  addition  to  terminal  accuracy,  another  quantity  of  interest  in 
evaluating  performance  is  the  control  effort 

T 

J  =  t  5. (t)2  dr  (rad2  sec)  (11.4-13) 

i  Jt 

which  is  a  measure  of  the  control  surface  deflection  required  for  controlling 
the  missile.  This  is  determined  by  a  set  of  expressions  similar  to  Eos 
(11.3-14)  and  (U.3-15);  i.e., 
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X;:  ACTUAL  AUTOPILOT  POLES 
FOR  i'-b  SUBOPTIMAL  CASE 

Oj:  AUTOPILOT  POLE  FOR  FIRST  ORDER 

APPROXIMATION  FOR  I  &  SUBOPTIMAL  CASE 

Figure  11.4-3  Closed  Loop  Pole  Locations  for  the  Second  Order 

Autopilot  and  its  First  Order  Approximation 


J  =  z(t)T  C(t)z(t) 

U.  1 


^(t)  =  -C.(t)F.(t)-Fi(t)TCi(t)-ki(t)k.(t)T 


C^T)  =  0  (11.4-14) 
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where  k.(t)  and  F.(t)  are  obtained  from  Eqs.  (11.4-4),  (11.4-10)  and 
(11.4-11).  As  in  preceeding  sections  we  define  a  root  normalized  effort 

J  =  [ c77  (rad-sec ^Vunit  state)  (11.4-15) 

U..  V  1]  '  ^ 

i]  J 

where  c..  is  the  j^1  diagonal  element  of  C.  so  that  the  effort  produced  by 

v 

an  initial  condition  z^(t)  is  calculated  from 


Note  that  the  units  of  Ju-  are  different  here  than  in  Eq.  (11.3-21)  be¬ 
cause  of  the  different  definitions  in  Eqs.  (11.3-13)  and  (11.4-13).  Plots 
of  JU12  corresponding  to  Fig.  11.4-2  are  given  in  Fig.  11.4-4. 

To  effectively  compare  the  optimal  and  suboptimal  control  laws, 

we  crossplot  values  of  mi2  and  Ju_2  from  Figs.  11.4-2  and  11.4-4  for  given 

values  of  time-to-go,  in  the  same  fashion  as  Fig.  11.8-14  is  derived.  The 

curves  are  given  in  Fig.  11.4-5  for  t  =  0.4  sec  and  1.0  sec.*  These 

go 

data  can  be  used  to  analyze  the  effects  of  various  initial  values  of  cross 
track  velocity  an  guidance  accuracy  and  control  effort,  as  is  done  in  Table 
11.3-1.  The  only  difference  is  that  the  effort  is  now  integral  square  control 
surface  deflection  rather  than  integral  square  commanded  acceleration. 


* 

Note  that  the  number  of  data  points  is  too  small  to  draw  a  smooth  con¬ 
necting  curve  in  some  cases.  This  is  because  the  oscillatory  nature  of  the 
curves  for  the  suboptimal  laws  in  F>.  11.4-2  causes  wide  fluctuations 
in  the  values  of  j  m^l  • 
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Figure  11.4-4  Root  Normalized  Effort  Caused  by 

an  Initial  Cross-Track  Velocity 
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Figure  11.4-5  Root  Normalized  Effort  Versus  Normalized  Terminal 

Miss  Caused  by  an  Initial  Cross -Track  Velocity 


Recall  that  there  are  two  aspects  of  the  suboptimal  law  6g(t)  that 
make  it  inferior  to  the  optimal  control  law  when  judged  an  the  basis  of  the 
index  J,  in  Eq.  (11.4-3).  First  it  is  derived  with  commanded  acceleration 
considered  as  the  control  variable  rather  than  control  surface  deflection 
and  the  autopilot  feedback  gains  are  preselected  constants  c^  and  C2« 
Second,  it  is  derived  assuming  the  autopilot  is  a  first  order  lag,  but  the 
actual  autopilot  dynamics  are  second  order.  Both  of  these  assumptions 
are  responsible  for  the  deviations  of  the  suboptimal  performance  curves 
from  those  for  the  optimal  control  law  in  Fig.  11.4-5;  the  reasons  for  this 
are  outlined  below. 

First  of  all,  the  fixed  feedback  gains  Cj  and  Cg  in  the  suboptimal 
design  tend  to  make  the  autopilot  response  faster  than  it  needs  to  be  when 
the  missile  is  some  distance  from  the  target.  Consequently  an  ir  i*ial 
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cross-track  velocity  causes  more  control  surface  deflection  (in  the  integral 
square  sense)  than  is  necessary  to  achieve  the  desired  terminal  accuracy. 
This  effect  is  most  significant  in  suboptimal  case  #2  and  contributes  to  the 
deviations  from  optimum  of  the  corresponding  performance  curves  in 
Fig.  11.4-5. 

In  the  second  place,  as  the  weighting  constant  r  decreases,  the 

feedback  gains  h^(t)  in  the  suboptimal  steering  law  6g(t) increase  and  the 

overall  system  response  becomes  more  sensitive  to  the  use  of  a  wrong 

(first  order)  model  for  the  autopilot.  This  effect  is  observed  through  the 

oscillatory  behavior  of  the  curves  in  Fig.  11.4-2,  particularly  case  #1 

where  it  has  already  been  noted  that  the  first  order  lag  approximation  is 

especially  poor.  The  resulting  contribution  to  terminal  miss  distance 

-4 

depends  upon  t  .  Thus  a  value  of  r  =  2.0  x  10  ,  which  gives  good  per¬ 

formance  att^  =  0.4  sec  in  Fig.  11.4-5,  gives  poor  performance  at 
t  =  1.0  sec,  relative  to  the  optimal  control.  If  the  weighting  constant  r 
gets  very  low,  the  suboptimal  guidance  system  actually  becomes  unstable 
because  the  denominator  of  the  factor 

1 

1  +c0h.(tp/L 
3  4W  a 

in  Eq.  (11.4-10)  becomes  negative  as  h^(t)  increases  in  magnitude,  making 
all  the  gains  in  kg(t)  have  the  wrong  sign.  The  conclusion  is  that  optimal 
control  theory  can  be  helpful  in  designing  laws  that  account  for  missile 
dynamics,  provided  these  dynamics  are  reasonably  accurately  modeled. 
Otherwise  the  true  system  performance  may  be  substantially  poorer  than 
that  prescribed  by  the  theory. 

Heretofore  nothing  has  been  said  about  the  behavior  of  the  feed¬ 
back  gains  in  optimal  guidance  systems.  It  has  been  possible  to  ascertain 
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their  effect  on  system  performance  using  adjoint  theory,  without  knowing 
their  specific  time  history.  However  the  optimal  gains  associated  with 
6^(t)  are  of  special  interest.  Figure  11.4-6(a)  is  a  plot  of  the  first  ele¬ 
ment  of  kj(t)  corresponding  to  the  simulation  data  in  Figs.  11.4-2(a)  and 
11.4-4(a).  For  comparison  purposes  the  first  element  of  the  set  of  gains 
kgft)  for  suboptimal  case  2  is  shown  in  Fig.  11.4-6(b).  Only  one  p  in  from 
each  set  is  required  because  they  all  behave  qualitatively  in  the  same 
fashion. 


The  point  of  interest  is  the  fact  that  the  optimal  gains  become 
slightly  negative  just  before  intercept;  this  happens  for  all  the  elements 
of  kj(t).  Consequently  the  guidance  system  is  driven  into  what  can  be 
considered  an  unstable  condition  for  a  short  period  of  time.  The  reason 
for  this  behavior  is  that  the  guidance  law  takes  advantage  of  the  fact  that 
the  particular  missile  represented  by  these  simulations  can  achieve  a 
limited  amount  of  lift  from  its  tail  control  surfaces  as  well  as  from  its 
fixed  wings.  Normally  the  principal  lift  force  is  supplied  by  the  wings; 
the  tail  controls  provide  the  pitching  moment  required  to  change  angle  of 
attack,  thereby  varying  the  magnitude  of  the  lift.  However  very  near 
intercept  there  is  insufficient  time  to  change  the  lift  force  on  the  wings, 
since  this  requires  rotating  the  entire  missile;  therefore  steering  com¬ 
mands  can  best  be  realized  using  the  faster  responding  control  surfaces. 

No  such  behavior  is  observed  for  the  suboptimal  gains  because  the  action 

* 

of  the  control  surface  is  not  included  in  the  model  dynamics.  Typically 
this  gain  reversal  is  small  in  magnitude  and  it  occurs  just  before  inter¬ 
cept;  consequently  it  has  little  net  effect  on  guidance  accuracy.  However, 

% 

Analytically  speaking,  the  behavior  of  the  optimal  gains  is  a 
reflection  of  the  fact  that  the  airframe  transfer  function  between 
control  surface  deflection  and  normal  acceleration  has  a  right- 
half-plane  zero. 
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Figure  11.4-6  Comparison  of  Representative  Time  Histories 

for  Optimal  and  Suboptimal  Gains 

it  suggests  that  if  the  vehicle  had  rapidly  responding  rotatable  wings 
mounted  near  the  center  of  gravity  in  addition  to  tail  control  surfaces,  then 
an  optimal  control  law  would  make  use  of  the  wing  controls  to  generate  the 
primary  lift  force, using  the  tail  controls  only  to  balance  the  resulting  lift- 
induced  moments.  That  is,  the  entire  missile  airframe  would  not  have  to 
rotate  to  develop  an  angle  of  attack;  instead  the  wings  would  rotate  with 
respect  to  the  missile  body,  presumably  resulting  in  more  rapid  control 
system  response  characteristics.  Such  a  design  seems  attractive  for  lifting 
vehicles  in  those  flight  regimes  where  tail  controls  alone  are  incapable  of 
providing  sufficiently  rapid  response.  Of  course  against  this  potential 

'  f 

advantage,,  itbe  weapons  designer  must  weigh  the  need  for  two  sets  of  con- 
trol  surfaces  and  must  consider  whether  the  resulting  configuration  has 
desirable  aerodynamic  properties. 
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Another  point  of  interest  about  the  set  of  optimal  feedback  gains 
(see  Fig.  11.4-6(a))  is  that  they  are  generally  increasing  as  time-to-go 
decreases  (except  very  close  to  the  end  of  the  trajectory).  Physically 
this  is  reasonable  behavior  because  rapid  system  response  near  the  end  of 
the  trajectory  is  required  to  achieve  a  low  value  of  terminal  miss.  The 
gains  decrease  at  the  end  because  the  lag  associated  with  the  guidance  sys¬ 
tem  dynamics  reduces  the  sensitivity  of  the  terminal  miss  distance  to  the 
control  level.  If  the  optimal  gains  associated  with  the  airframe  state 
variables  —  k^g(t)  and  k^(t)  in  Eq.  (11.4-4) —  are  considered  together 
with  the  airframe  dynamics  it  is  found  that  the  primary  effect  of  the  gains 
is  to  produce  a  time -varying  autopilot  whose  instantaneous  pole  locations 
move  to  the  left  (right)  in  the  complex  plane  as  kjg  and  k^  increase 
(decrease)  is  magnitude.  Consequently  ,  the  optimal  coupied  guidance - 
autopilot  steering  law  effectively  produces  an  autopilot  whose  response 
characteristic.-,  vary  along  the  trajectory.  The  advantage  of  this  over  the 
control  methods  discussed  in  Section  11. 2  is  that  less  control  energy  is 
consumed  along  the  trajectory.  However,  if  energy  consumption  is  not 
important,  good  terminal  accuracy  can  also  be  achieved  by  predesigning 
the  autopilot  to  have  as  good  response  characteristics  as  possible  every¬ 
where  along  the  trajectory  and  then  formulating  the  guidance  problem  with 
the  autopilot  dynamics  neglected,  as  in  deriving  a^O)  and  aCg(t)  in  Eqs. 
(11.2-24)  and  (11.2  -26).  The  conclusion  is  that  it  is  useful  to  include  air¬ 
frame  dynamics  in  the  formulation  of  the  guidance  problem  only  if  expen¬ 
diture  of  control  actuation  energy  is  an  important  consideration*;  other¬ 
wise  the  usual  convention  of  separating  the  design  of  the  autopilot  and 
guidance  loop  is  valid. 

The  purpose  of  this  section  has  been  to  demonstrate  the  effects 
on  performance  of  simultaneously  designing  the  autopilot  and  guidance 

See  the  discussion  immediately  following  Eq.  (11.4-3)  regarding  the 
significance  of  as  a  measure  of  control  energy. 
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loops  for  a  tactical  missile.  Three  important  conclusions  are  drawn  from 
the  abo-  3  discussion. 


•  If  autopilot  dynamics  are  included  in  the  problem 
formulation  for  an  optimal  steering  law,  they  must 
be  accurately  modeled;  otherwise  the  guidance 
accuracy  may  be  adversely  affected.  In  particular, 
the  assumption  that  the  autopilot  is  a  first  order  lag 
when  it  actually  is  second  order  with  lightly  damped 
complex  poles  is  a  relatively  poor  approximation. 

•  If  control  actuation  energy  expenditure  is  not  an 
important  consideration,  the  usual  convention  of 
separating  the  design  of  the  autopilot  and  guidance 
loops  is  valid. 

•  Guidance  loop  performance  may  be  significantly 
improved  if  a  faster  responding  control  technique, 
such  as  rotatable  wings  in  conjunction  with  tail 
controls  is  used. 

The  need  for  adaptive  control  or  identification  of  missile  airframe  dynamics 
is  implied  throughout  this  discussion.  If  the  autopilot  is  to  be  predesigned 
to  achieve  as  good  response  as  possible  everywhere  along  a  trajectory 
that  has  variable  flight  conditions,  then  adaptive  control  techniques  are 
needed.  Alternatively,  if  airframe  dynamics  are  to  be  incorporated  in  the 
guidance  problem  formulation,  as  in  £q.  (11.4-2),  then  the  parameters 
M^,  Mg,  etc.  must  be  identified.  These  tasks  can  be  accomplished  by  any 
of  the  methods  described  in  previous  chapters. 


11 . 5  SUMMARY  AND  CCNC  LUSIONS 

This  chapter  begins  with  a  review  of  classical  homing  guidance 
techniques  —  pursuit,  beam  rider,  and  proportional  guidance.  Proportional 
guidance  is  generally  considered  the  best  of  the  classical  methods  because 
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it  is  a  self  contained  system  that  is  theoretically  able  to  null  the  terminal 
miss  when  attacking  a  moving  target.  However  the  presence  of  target 
maneuvers,  autopilot  lag,  control  effort  constraints,  and  control  mag¬ 
nitude  limitations  suggests  that  steering  laws  designed  with  optimal  con¬ 
trol  techniques  may  offer  improvement  in  performance. 

Several  optimal  steering  laws  are  investigated  in  Sections  11.2, 
11.3,  and  11.4,  each  taking  into  account  a  number  of  different  effects. 
These  are  summarized  in  Table  11.5-1.  The  steering  laws  are  evaluated 
with  respect  to  one  another  by  applying  them  to  a  common  set  of  equations 
of  motion.  The  principal  conclusions  are: 


•  In  the  presence  of  target  maneuvers,  optimal  steering 
laws  that  include  a  term  proportional  to  measured 
target  acceleration  offer  substantial  improvement  in 
guidance  accuracy  over  those  that  do  not,  as  indicated 
in  Fig.  11.3-15. 

•  Optimal  steering  laws  that  correctly  include  the  effects 
of  autopilot  dynamics  offer  significantly  improved 
guidance  accuracy  for  steering  commands  that  are 
initiated  at  a  value  of  time-to-go  having  the  same  order 
of  magnitude  as  the  effebtive  autopilot  lag.  This  is 
illustrated  by  Figs.  11.3-14  and  11.3-15. 

•  If  autopilot  dynamics  are  imperfectly  known  (e.g. ,  if  a 
second  order  autopilot  having  a  low  damping  factor  is 
approximated  by  a  first,  order  lag)  the  performance  of 
the  optimal  steering  law  may  be  significantly  degraded 
from  that  predicted  by  analysis. 

•  From  the  standpoint  of  implementation,  the  improvement 
in  performance  obtained  by  measuring  target  accelera¬ 
tion  may  justify  the  added  computation  required  in  the 
guidance  laws,  as  compared  with  conventional  proporr 
tional  guidance.  However,  the  additional  complexity 
required  to  include  aiiopilot  or  uncompensated  airframe 
dynamics  in  the  guidance  law,  even  if  accurately  known, 
is  probably  not  worthwhile  unless  control  actuation 
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TABLE  11.5-1 

SUMMARY  OF  OPTIMAL  STEERING  LAWS 


Conditions  for  Optimality 

Steering 

Laws* 

Maneuvering 

Target 

Neglect 

Autopilot 

Dynamics 

First/Order 

Autopilot 

Second/Order 

Airframe 

Minimize 

T 

Xl(T)2+rJtac(T^dr 

Minimize 

T 

r  (T)2-r  (  6(t)2  dr 
t 

a.  (t) 

C1 

X 

X 

X 

a  (t) 
c2 

X 

X 

X 

ac  (t) 
c3 

X 

X 

ij(t) 

X 

X 

X 

62(t) 

X 

X 

_ 

X 

»c(t)  =  commanded  normal  acceleration 
fl(t)  =  commanded  control  surface  deflection 


x^r),  *j(t)  =  terminal  miss  distance 
r,  rc  -  weighting  constants 


*See  Eqs.  (il.  2-19),  (11.2-24),  (11.2-28),  (11.4-4)  and  (11.4-6). 
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effort  is  a  critical  consideration;  a  more  practical 
method  for  improving  guidance  accuracy  is  to  pre- 
design  the  best  possible  autopilot. 

•  Given  a  set  of  candidate  steering  laws,  it  is  possible 
to  generate  graphical  displays  of  performance  data 
using  linear  sensitivity  analysis.  The  plots  permit 
rational  comparisons  of  the  techniques,  based  on 
guidance  accuracy  and  control  effort  expended.  This 
information,  considered  together  with  the  compu¬ 
tational  complexity  of  each  steering  law,  will  aid 
in  making  a  specific  selection. 

The  above  conclusions  should  be  regarded  as  tentative,  subject  to  further 
refinement  in  an?  articular  application,  after  considering  effects  of  random 
measurement  noise,  control  level  and  normal  acceleration  limiting,  and 
time-varying,  random  or  intelligent  target  maneuvers. 

The  above  observations  have  a  direct  bearing  on  missile  guidance 
system  design.  First  of  all,  the  optimal  steering  laws  use  time-varying 
feedback  gains  which  may  require  more  computer  storage  than  is  available 
for  a  particular  application.  Therefore  a  designer  might  choose  to  use 
constant  feedback  gains  instead;  if  so,  the  optimal  design  can  be  used  as  a 
standard  of  comparison  to  evaluate  the  suboptimal  system. 

If  the  effoirt  expended  by  the  missile  control  actuators  is  an 
important  consideration,  there  is  some  advantage  in  including  the 
dynamics  of  autopilot  (a^t)  in  Section  11.2-19)  or  the  uncompensated  air¬ 
frame  (6j(t)  in  Section  11.4)  in  the  guidance  problem  formulation.  The 
resulting  optimal  guidance  law  effectively  provides  an  autopilot  having 
time-varying  feedback  gains  that  tend  to  increase  as  the  range-to-go 
decreases.  Thus  over  a  long  trajectory,  some  saving  in  control  effort  is 
achieved  because  the  autopilot  bandwidth  is  large  only  near  tike  end  of  the 
trajectory.  However,  if  the  actuator  power  supply  is  more  than  sufficient 
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for  the  mission  under  consideration,  it  is  reasonable  to  predesign  the 
autopilot  to  have  as  good  a  response  as  possible  at  all  flight  conditions 
(using  any  of  the  methods  discussed  elsewhere  in  this  report)  and  then  to 
utilize  a  linear  guidance  law  that  neglects  the  resulting  autopilot  dynamics. 

Another  important  consideration  is  the  conclusion  that  autopilot 
or  airframe  dynamics  should  be  accurately  modeled  if  they  are  to  be 
included  in  an  optimal  steering  law;  otherwise  the  performance  benefit 
expected  from  the  more  complex  steering  law  (e.  g. ,  compare  the  com¬ 
plexity  of  Eqs.  (11. 2-19)  and  (11.2-24))  may  not  be  realized.  Therefore 
when  flight  conditions  are  time -varying  an  adaptive  capability  --  either 
for  identifying  airframe  parameters  or  for  producing  an  adaptive  autopilot 
that  has  predictable  dynamics  --is  desirable. 

Finally,  it  may  be  feasible  to  solve  the  problem  of  achieving 
good  guidance  accuracy,  as  it  is  related  to  the  autopilot  response  time,  by 
using  rapidly  acting  missile  control  mechanisms  such  as  rotatable  wings. 
The  latter  would  overcome  the  inherently  slower  response  characteristics 
of  the  conventional  tail-controlled,  fixed-wing  vehicle. 
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APPENDIX  A 

LINEAR  DYNAMICAL  SYSTEMS 


A.  1  STATE  SPACE  NOTATION 

Almost  all  of  the  recent  advances  in  control  theory  —  work  by 
Pontryagin,  Bellman,  Liapunov,  Kalman  and  others  —  are  formulated  in 
state  space  notation.  In  addition,  this  manner  of  stating  a  problem  keeps 
it  closer  to  physical  reality  than  the  classical  transform  techniques. 

A  Dynamic  System  —  The  dynamics  of  a  linear  system  can  be 
represented  by  a  first  order  vector-matrix  differential  equation  in  which 
x(t)  is  the  system  state  vector  and  u(t)  is  a  forcing  function,  viz: 

x(t)  =  F(t)  x(t)  +  G(t)  u(t)  (A-l) 

This  is  the  continuous  form  employed  in  most  modem  control  theory. 
Figure  A-l  illustrates  the  equation.  The  state  vector  of  a  dynamic  sys¬ 
tem  is  composed  of  any  set  of  quantities  sufficient  to  completely  describe 
the  unforced  motion  of  that  system;  given  the  state  vector  at  a  particular 
point  in  time  and  a  history  of  the  system  forcing  function,  the  state  at  any 
other  time  can  be  computed.  The  state  vector  is  not  necessarily  a  unique 
set  of  variables;  several  sets  may  be  able  to  fulfill  the  above  requirement. 

Given  a  high-order  linear  differential  equation, 

(Dn  +  an-1(t)  Dn_1  +  .  .  +  aj(t)  D  +  aQ(t))  y(t)  =  f(t) 

where  D3  =  dVdti,  we  can  define  a  set  of  state  variables  Xj(t), . .  ,x  (t) 
by 
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Figure  A-l  Illustration  of  Continuous 
Representation  of  Linear 
Dynamic  Equations 


Xj(t)  =  y(t) 

Xg(t)  =  Xj(t) 


■  Vl<‘> 

These  variables  can  be  used  to  write  the  high  order  differential  equation 
as  a  set  of  first  order  linear  differential  equations: 

*l(t)  =  ^(t) 

X2(t)  =  Xg(t) 


xn(t)  =  -ao(t)Xl(t)  -ai(t)x2(t)-.  .  .  -an_1(t)xn(t) +f(t) 
Or,  in  the  form  of  Eq.  (A-l); 
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L*»  J 

L  o  1  2  n-2  n-1 

X 

n 

f 

Equation  (A-2)  is  illuatrated  in  block  diagram  form  in  Fig.  A-2.  Notice 
that  the  state  variables  are  in  each  case  the  outputs  of  integrations. 


D-223 


in  many  linear  systems  of  interest  the  forcing  i  .mction  is  multi¬ 
variable;  that  is,  u(t)  in  Eq.  (1)  is  composed  of  several  nonzero  functions. 
Also,  the  individual  elements  of  u(t)  may  drive  several  state  variables 
simultaneously,  causing  G(t)  to  be  a  matrix  with  significant  elements  at 
locations  not  on  its  diagonal.  In  these  cases  the  system  dynamics  may  be 
determined  directly  from  the  physical  description  of  the  problem. 
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For  example,  a  block  diagram  of  the  physical  system  may  be  sketched  and 
the  first  order  vector-matrix  differential  equation  determined  by  inspec¬ 
tion.  Figure  A- 3  illustrates  a  hypothetical  linear  dynamic  system  forced 
by  several  inputs.  The  outputs  of  all  the  integrators  constitute  a  conven¬ 
ient  set  of  state  variables.  The  system  dynamic  equations  can  be  written 
in  the  form  of  Eq.  (A-l); 
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Reference  34  demonstrates  the  steps  required  to  convert  a  higher  order 
differential  equation  into  a  set  of  state  variables  driven  by  a  multi- 
variable  forcing  function. 
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Figure  A-3  Block  Diagram  of  a  Hypothetical 
Linear  Dynamic  System 
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A.  2  THE  TRANSITION  MATRIX 

The  homogeneous  unforced  matrix  differential  equation  corres¬ 
ponding  to  Eq.  (A-l)  is 

x(t)  =  F(t)  x(t)  (A-3) 

Suppose  that  at  some  time,  r,  all  but  one  of  the  outputs  of  the  integrators 
in  Fig.  A-3  were  set  to  zero  and  no  inputs,  u^,  are  present.  Also  assume 
that  the  nonzero  integrator  output  was  given  the  value,  one.  The 
behavior  of  the  state  vector  for  all  times  after  r  could  be  expressed  in 
terms  of  a  time-varying  "solution  vector,"  cp^(t,  r),  where  the  subscript 
refers  to  the  integrator  whose  output  is  nonzero. 

*  /X.  \  I 

£j(t,T)  = 

If  the  initial  condition  on  the  j"1  integrator  is  something  other 
than  unity,  c  for  example,  then  from  the  linear  nature  of  the  system 

£.(t,T,c)  =  C<fl  (t,T) 

J  ~  J 

Also,  if  integrators  i  and  j  both  have  nonzero  outputs  a  and  Cj  at  time  r, 
the  response  for  the  system  is  just  the  sum  of  the  individual  response 
vectors 

JPi  j(^>T,Cj,Cj)  -  C^<g^(t,T)  +  Cj(£j(t,  t) 

But  this  can  be  written  as  a  product  of  the  matrix 

[fii :  *j] 


xn(t’T)j 


(A-4) 
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and  the  vector 

-  • 

c. 

1 

Cj 

L.  J 

In  general  every  integrator  can  have  a  nonzero  value  at  r  and  the  time 
history  of  the  state  becomes  the  sum  of  the  individual  effects 


X(t)  =  £<g^(t,  t)  !  <^(t,T) ! .  .  .  jign(t,r)j  x(r) 

which  is  written  for  compactness  as 

X(t)  =  $(t,T)x(T) 

The  matrix  $(t,r)  is  called  the  transition  matrix  for  the  system  of 
Eq.  (A-3).  The  transition  matrix  allows  calculation  of  the  state  vector 
at  some  time,  t,  given  the  state  vector  at  t. 


Returning  to  Eq.  (A-4),  it  can  be  seen  that  the  solution  vectors 
obey  the  differential  equation 


where 


« -  *  Wjjfc* 


?j(r,r)  -  e.j 
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Therefore,  the  transition  matrix,  composed  of  the  vectors  <o.,  obeys  the 
equation 

3*(t,r)  =  F(t)#(t,r) 

*  I 

The  transition  matrix  $(tj,t0)  relates  x(tj_)  to  x(t0)  inEq.(A-3); 


Similarly, 


Therefore, 


x(t][)  =  $(tl5t0)x(t0) 

x(t2)  =  x(tx) 

=  $(t2,ti)  ^(t^toJxCto) 

*(t2,t0)  =  S^tj)  ^(t^to) 


This  principle  is  illustrated  in  Fig.  A-4  for  a  first  order  system  having  a 
scalar  transition  matrix,  <p(t ,  t ) .  It  is  a  general  property  of  the  state 
transition  matrix,  independent  of  the  order  of  t0,  t^  and  t2  in  time.  In 
addition , 

$(t,t)  =  $(t,T)$(r,t)  =  I 

Premultiplying  this  expression  by  4>~*(t,  r)  provides  the  useful  relation 

<J>  ^(t, t)  =  $(t, t) 


Transition  Matrix  for  Stationary  Systems  —  For  a  stationary 
system,  the  F  matrix  is  time-invariant  and  the  transition  matrix  depends 
only  on  the  time  interval  considered: 

4>(t,T)  =  $>(t-r) 
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x(t2)  =  <p(t2,t1)x(t1) 


(t-t0)2 

x(t)  =  x(t0)  +  x(t0)(t-t0)  +  x(t0)  — gy—  +  .  .  . 
But 


x(tQ)  =  F  x(t0) 

*(t0)  =  Fx(to)  =  F2x(to) 

etc. 

Substituting,  the  expansion  becomes 

F2(t-tQ)2 

x(t)  =  x(t0)  +  F(t-to)  x(to)+ - 21 - x(t0)+.  .  . 

=  [i  4  F(t-t(j)  4  i.  F2(t-t0)24 ...  1  x(t0)  (A- 5) 
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By  analogy  to  the  scalar  case  the  matrix  exponential  is  defined  by 


F  ^  t  +  tt  + 

e  1  *  2!  3! 


Consequently,  the  transition  matrix  for  the  stationary  system  can 
identified  from  Eq.  (A- 5)  as 


(A- 6) 
be 


F(t-t0) 

<f>(t-t0)  =  e 

which  depends  only  on  the  stationary  system  dynamics  (F)  and  the  interval 

t-t0o 


In  stationary  systems,  t0  may  often  be  assigned  the  value  zero. 
A  useful  expression  for  the  transition  matrix  $>(t,0)  can  be  obtained  by 
taking  Laplace  transforms  in  Eq.  (A-3).  Defining  X(s)  as  the  transform 
of  x(t),  one  obtains 

sX(s)  =  FX(s)  +  x(0) 

or  alternatively 

X(s)  =  (Is  -  F)"1  x(0)  (A- 7) 

Comparison  of  Eq.  (A-7)  with  the  expression 

x(t)  =  $>(t,0)x(0) 

implies  that 

4>(t,0)  -  L-1  |(Is  ~  F)_1| 

where  L  f  1  denotes  he  inverse  Laplace  transform.  Defining  $(s)  to 
be  the  Laplace  transform  of  <f>(t,  0),  one  has 

A 

$(s)  =  (Is  -  F)"1  (A-8) 
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A. 3  THE  MATRIX  SUPERPOSITION  INTEGRAL 


Consider  now  the  general  linear  system  of  Eq.  (A-l),  including 
the  forcing  function  u(t); 

x  =  Fx  +  Gu  (A-9) 

Referring  to  Fig,  A-5  we  can  see  that  the  effect  of  the  input  to  the  ith  inte¬ 
grator  of  Fig.  A- 3  over  a  small  interval  (r  -  At,  t)  car*  be  represented  as  an 
impulse  whose  area  is  the  value  of  the  i^  element  of  the  vector  G(t)u(t)  times 
the  interval  At.  Temporarily  assuming  that  the  initial  conditions  on  all  the 
integrators  are  zero,  this  impulse  will  cause  a  small  change  Axi  in  the 
integrator  output; 

Ax.(t)  =  (G(t)  u(t)^  At 

The  change  in  the  entire  state  vector  can  be  expressed  as 


Ax(t)  = 


G(t)  u(t)  At 


The  effect  of  this  small  change  on  the  state  at  some  subsequent  point  in 
time  can  be  expressed  by 

Ax(t),  given  an  impulse  input  G(t)  u(t)  At  =  $(t,  t)  G(t)  u(t)  At 

Because  the  system  is  linear  the  response  to  the  complete  input  history 
can  be  viewed  as  the  sum  of  the  responses  to  individual  impulses.  In  the 
limit  as  At  -  0  the  effect  of  the  input  on  the  state  at  some  time,  t,  can  be 
represented  by 

rt 

x(t)  =  \  $(t,  t )  G(t )  u(t )  dr 
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R-226 


Figure  A-5  Representation  of  the  Input  to 
the  i*h  Integrator  as  an  Impuice 


Now  because  the  system  is  linear,  the  effects  of  initial  conditions  at 
time  t0  and  the  presence  of  a  forcing  function  after  time  t0  can  be  com  ¬ 
bined  to  yield 

A 

x(t)  =  $(t,t0)  x(t0)  +  \  $(t ,t)  G(t)  u(t)  dT  (A-1G) 

The  integral  in  Eq.  (A- 10)  is  often  called  the  matrix  superposition  integral. 
Of  possible  use  in  solving  Eq„  (A- 10)  is  the  relation 
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The  derivative  is  given  by 

A  *(t  T)  =  Um  »(t,T)-$(t?  T-AT), 

dr  '  ’  At  -•  0  At 

_  lim  -»(t,  r)  F(t)  (At)  +Q(At2) 
At-0  At 

=  -  *(t,  t)  F(t) 


A. 4  CONTROLLABILITY  AND  OBSERVABILITY 

Consider  a  linear  dynamical  system  for  which  a  set  of  linear 
measurements  y(t)  is  defined  by 

x(t)  =  F(t)  x(t)  +  G(t)  u(t) 

Z(t)  =  H(t)  x(t)  (A- 11) 

The  measurements  are  those  quantities  which  can  be  directly  observed  at 
various  system  output  "terminals"*.  For  this  system  of  equations,  u(t) 
is  the  control  and  y(t)  is  the  observation.  Two  fundamental  properties  of 
the  system,  related  to  these  sets  of  variables  are  defined  as  follows: 

A  system  is  controllable  at  time  tj  >  tg  if  it  is  possible 
to  choose  u(t)  in  the  interval  tg  *t  stj  to  "drive"  any 
state  x(tg)  =  £  to  any  point  x(tj)  =  y. 

A  system  is  observable  at  time  tj  >  tg  if  it  is  possible 
to  determine  the  state  x(tg)  by  observing  y(t)  in  the 
interval  tg  s  t  s  tj. 

Controllability  determines  whether  one  can  achieve  any  desired  state  by 
manipulating  u(t)  and  observability  determines  whether  one  can  determine 

the  value  of  the  state  at  any  time  by  measuring  y(t). 

* 

This  terminology  is  suggested  by  the  input  and  output  terminals  of 
electric  networks. 
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Conditions  which  determine  whether  the  linear  system  in 
Eq.  (A-ll)  has  these  properties  are  determined  by  the  algebraic  proper¬ 
ties  of  the  matrices  F(t),  G(t),  and  H(t)  (Ref.  30).  For  constant  systems, 
necessary  and  sufficient  conditions  are : 


A  constant,  n^1  order,  linear,  dynamical  system  is 
controllable  if,  and  only  if,  the  matrix 

F11"1  g]  (A- 12) 


©  =  i 


i  i  2  1 
G I FG i F  G ! 

i  i  1 


has  rank*  n. 

A  constant  n^1  order  linear  dynamical  system  is 
observable  if,  and  only  if,  the  matrix 

A  ==  |hT!fTET!  (Ft)2HT|  *  •  •  |  (FT)n"1HTj  (A- 13) 

has  rankn. 


Note  that  because  the  system  is  time  invariant,  the  time  t ^ , 
does  not  appear  in  Eqs.  (A-12)  and  (A-13),  implying  that  t^  is  arbitrary 
so  long  as 

‘l  >  *0 

This  concludes  a  brief  summary  of  the  important  properties  of 
linear  systems  used  in  this  report.  The  reader  is  referred  to  Refs.  30 
and  34  for  more  details  on  the  subject. 


That  is,  at  least  one  set  of  n  columns  of  ©  form  a  set  of  linearly 
independent  vectors. 
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APPENDIX  B 

OPTIMAL  CONTROL  OF  LINEAR  SYSTEMS 


B„1  INTRODUCTION 

The  basic  task  faced  by  a  control  system  designer  can  often  be 
expressed  in  terms  of  the  solution  x(t)  to  a  set  of  first  order  differentia] 
equations 

x(t)  =  f[x(t),  u(t),  t];  x(t0)  =  x 0  (B-l) 

The  vector  x(t)  is  called  the  state  of  the  system  and  u(t)  is  a  vector  set  of 
control  variables  which  can  be  specified  by  the  designer.  The  objective  is 
to  determine  u(t)  such  that  x(t)  achieves  some  desired  behavior  subject  to 
the  cause -effect  relationship  provided  by  Eq.  (B-l).  For  example,  it  may 
be  required  that  certain  elements  of  the  state  vector  take  on  specified 
values  at  a  given  terminal  time  tf,  or  that  the  solution  to  Eq.  (B-l)  (also 
referred  to  as  the  trajectory  of  the  state)  should  possess  certain  stability 
properties. 

Generally  speaking,  design  criteria  that  are  related  only  to  the 
behavior  of  x(t),  are  not  sufficient  to  determine  a  unique  control  u(t). 

Clearly  if  one’s  purpose  is  to  transfer  the  state  from  one  point  x(t0)  =  x0 
in  ’’state  space”  to  another  x(tf)  =  x^  without  any  specification  on  the  path 
taken  by  x(t)  or  limits  upon  the  control,  then  there  are  likely  to  be  an 
infinite  number  of  paths  and  control  functions  that  can  be  employed.  Con¬ 
sequently  additional  design  requirements  can  be  accommodated. 

Usually  it  is  true  that  the  implementation  of  different  controllers, 
all  of  which  satisfy  the  desired  criteria  on  the  state,  costs  different  amounts 
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of  some  important  measure  of  acceptability,  —  e.g.,  system  complexity, 
weight,  and/or  volume.  For  example,  a  controller  for  a  self-contained 
tactical  missile  guidance  system  that  requires  a  large  computer  to 
mechanize  u(t)  would  be  unacceptable  with  respect  to  all  of  these  charac¬ 
teristics.  Often  this  "cost,"  although  quite  recognizable  in  the  resulting 
system  design,  is  difficult  to  describe  mathematically.  Therefore  it  may 
not  be  possible  to  include  it  in  the  design  criteria  in  any  systematic 
fashion.  In  such  circumstances  a  satisfactory  mechanization  is  achieved 
using  a  combination  of  intuition,  experience,  and  trial  and  error.  In 
other  situations  the  cost  can  be  expressed  as  a  mathematical  quantity;  a 
common  example  is  a  time  integral,  such  as 


Expressions  of  this  type  may  represent  the  total  amount  of  fuel  or  energy 
consumed,  in  which  case  it  is  desired  to  keep  their  values  small.  For 
this  case  a  reasonable  design  procedure  is  to  determine  the  control  so 
that  the  associated  cost  is  minimized,  subject  to  any  constraints  on  the 
behavior  of  x(t)  or  u(t).  The  solution  to  a  problem  posed  in  this  fashion, 
called  an  optimal  control  problem,  is  referred  to  as  an  optimal  control 
law;  it  is  the  best  control  to  use  for  the  particular  design  criteria 
selected  generally,  an  optimal  control  law  is  one  which  minimizes 

a  specified  performance  index,  defined  as  a  functional  of  the  state 
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and  the  control,  while  satisfying  the  specific  design  constraints.  Thus  an 
optimal  control  problem  provides  a  means  for  selecting  one  control  law 
from  the  many  which  are  permissible  when  there  is  no  performance  index. 

Another  property  one  desires  to  have  in  any  controller  design 
is  feedback;  that  is,  the  optimal  control  should  be  explicitly  a  function  of 
the  state  x(t).  When  it  has  this  form,  u(t)  is  referred  to  as  a  closed  loop 
or  feedback  control.  This  arrangement  is  needed  because  there  are 
always  forces  or  disturbances  acting  upon  a  physical  system  which  are  not 
accounted  for  in  the  mathematical  model  given  by  Eq.  (B-l).  These  cause 
the  state  to  deviate  from  the  path  which  is  predicted  by  integrating 
Eq.  (B-l)  with  a  specified  u(t).  If  the  latter  is  explicitly  a  function  of  time 
only  (i.e.  it  is  an  open  loop  control)  the  controller  never  senses  the  effect 
of  disturbances  on  the  state,  and  the  latter  may  drift  away  from  its 
desired  path,  possibly  in  an  unstable  fashion. 

The  above-mentioned  considerations  are  important  in  any  con¬ 
trol  system  design  problem.  This  appendix  presents  analytical  details  for 
formulating  and  obtaining  a  feedback  solution  for  a  special  kind  of  optimal 
control  problem  associated  with  linear  dynamical  systems.  Basic  text¬ 
book  references  for  this  subject  are  Athans  and  Falb  (Ref.  97)  and 
Bryson  and  Ho  (Ref.  137). 


B.2  THE  LINEAR  OPTIMAL  REGULATOR 

A  linear  dynamical  system  is  described  by  a  differential  equa¬ 
tion  of  the  form 

x(t)  =  A(t)  x(t)  +  B(t)  u(t);  x(tQ)  =  xq  (B-2) 
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where  x(t)  is  a  set  of  state  variables  which  describe  the  response  of  the 
system  to  a  set  of  control  inputs  u(t).  An  important  optimal  control 
problem  associated  with  this  system  requires  the  selection  of  u(t)  such 
that  the  "quadratic"  performance  index 

tf 

Jfx(t),u(t)j  =  x(tf)T  Fx(tf)+  J  j^x(t)TQ(t)x(t)  +  u(t)T  R(t)u(t) j  dt  (B-3) 

*o 

is  minimized  for  a  given  terminal  time  tf.  The  weighting  matrices  F, 
Q(t),  and  R(t)  are  to  be  specified  by  the  designer  where  F  and  Q(t)  are 
positive  semidefinite  and  R(t)  is  positive  definite.* 

Physically  spealcing,  the  terms  in  J  involving  the  state  provide 
a  measure  of  the  magnitude  of  x(t)  both  at  the  termination  time  and  during 
the  interval  (t0,tf).  The  term  involving  u(t)  is  a  measure  of  the  control 
effort  expended.  The  objective  is  to  make  x(t)  small  in  the  sense  defined 
by  the  magnitudes  of  the  individual  indices 

Jf  ~  x(tf)T  Fx(tf) 
t. 

Jx=  ^  x(t)T  Q(t)  x(t)  dt  (B-4) 


* 

A  positive  semidefinite  matrix  F  is  one  for  which  the  condition 
xTFx  a  0 

holds  for  all  vectors  x.  If  this  condition  holds  with  strict  inequality  for 
all  x  i  0,  F  is  said  to  be  positive  definite.  The  quantity  xTFx  is  known 
as  a  quadratic  form. 
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This  is  to  be  accomplished  with  the  competing  requirement  that  excessive 
control  leveis  not  be  used,  as  measured  by 


(B— 5) 


The  total  performance  index  is  the  sum  of  Jf,  J  and  J  ;  therefore  mini- 
mization  of  J  prevents  any  one  of  the  individual  indices  from  being  too 
large  at  the  expense  of  the  others. 

The  signifies  ce  of  using  quadratic  forms  to  measure  the  size 
of  the  control  and  the  state  is  that  they  heavily  penalize  large  values  of 
x(t)  and  u(t).  In  addition,  the  squares  of  state  and  control  variables  are 
often  identified  as  power  and  their  integral  square  values  are  interpreted 
as  energy;  e.g.,  Ju  in  Eq.  (B-5)  can  be  a  measure  of  energy  expended. 

An  additional  nonincidental  reason  for  the  popularity  of  quadratic  perform¬ 
ance  indices  is  that  the  solution  for  the  optimal  feedback  control  is  readily 
derived  and  it  is  a  linear  function  of  the  state;  hence  it  is  relatively  easy 
to  mechanize.  Other  types  of  optimal  design  criteria  often  permit  a  solu¬ 
tion  for  the  open-loop  optimal  control  but  the  corresponding  feedback  con¬ 
trol  is  usually  not  readily  obtained. 

The  solution  to  the  optimal  control  problem  described  above  is 

given  by 

u(t)  =  -  R(t)'1  B(t)T  S(t)  x(t)  (B-6) 

where  S(t)  is  the  solution  to  a  matrix  Riccati  differential  equation, 

S(t)  =  -  S(t)  A(t)  -  A(t)T  S(t)  +  S(t)  B(t)  R(tf 1  B(t)T  S(t)  -  Q(t) 

s(t{)  =  F  (B-7) 
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The  matrix  S(t)  can  be  determined  by  integrating  Eq.  (B-7'  backwards 
from  the  terminal  time.  Then  a  set  of  time- varying  gains, 

K(t)  =  H_1(t)  B(t)T  S(t),  (B-8) 

can  be  calculated  and  stored  in  a  controller  to  implement  the  feedback 
control  law 


u(t)  =  -  K(t)  x(t);  to  ^  t  s  tf  (B-9) 

In  this  form  the  control  is  optimal  for  all  time  in  the  interval  (t0,tf) 
with  x(t)  regarded  as  the  initial  state.  The  value  of  the  performance 
index  for  any  initial  condition  x(t)  at  time  t  s  t0  evaluated  over  the  interval 
(t,  tf)  is  given  by 

J  =  x(t)T  S(t)  x(t) 

From  the  standpoint  of  implementation,  observe  that  Eq.  (B-9) 
assumes  all  the  elements  in  the  state  vector  are  available  to  generate 
u(t).  Often  in  a  practical  situation  some  state  variables  cannot  be  mea¬ 
sured  directly;  instead  there  may  be  a  set  of  measurements  m(t)  related 
to  x(t)  by  a  linear  transformation 

m(t)  =  H(t)  x(t)  (B-10) 

To  implement  the  optimal  control  law,  x(t)  must  be  obtained  from 
m(t)  as  accurately  as  possible.  Methods  far  accomplishing  this 
task  when  the  system  of  equations,  Eqs.  (B-2)  and  (B-10),  is  observable 
(see  Appendix  A)  are  discussed  in  Refs.  138,  139,  and  140.  The  result  is 
that  an  estimate  x(t)  of  the  state  can  be  obtained  by  implementing  a  multi¬ 
dimensional  ’’filter"  defined  by  differential  equations  of  the  form 
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where  F(t)  and  G(t)  are  derived  either  from  the  principles  of  estimation 
theory  (Ref.  138)  or  from  linear  observer  theory  (Refs.  139  and  140). 
Consequently  when  all  state  variables  are  not  directly  measurable  (H(t) 
in  Eq.  (B-10)  cannot  be  inverted),  the  need  to  implement  an  expression 
such  as  Eq.  (B-ll),  as  well  as  Eq.  (B-9)  with  x(t)  substituted  for  x(t), 
complicates  the  structure  of  the  controller. 


B.3  CHOICE  OF  PERFORMANCE  WEIGHTING  MATRICES 

A  critical  step  in  defining  the  optimal  control  problem  is  the 
selection  by  the  designer  of  the  weighting  matrices  F,  Q(t)  and  R(t)  in 
Eq.  (B-3).  There  is  no  direct  procedure  for  accomplishing  this  task;  it 
embodies  all  of  the  subjectivity  remaining  in  the  design  problem.  The 
elements  in  any  one  matrix  can  be  selected  with  respect  to  each  other  on 
the  basis  of  the  relative  importance  of  various  products  of  state  vari¬ 
ables,  as  indicated  in  Ref.  137.  Often  none  of  the  cross  products  — 
x^t)  xj(t)  or  Uj(t)  Uj(t),  i  i  j  —  in  the  quadratic  forms  are  of  interest  so 
that  the  off-diagonal  terms  in  the  weighting  matrices  are  taken  as  zero. 
The  diagonal  elements  of  each  matrix  can  be  sized  according  to  the  de¬ 
sired  relative  magnitudes  of  the  elements  in  x(t)  and  u(t).  For  instance 
if  the  desired  magnitudes  of  Xj(tf)  and  xi(t^)  are  known  relative  to  each 
other,  say 


X^tf)* 


(B-12) 


then  require  that  the  diagonal  elements  f..  and  f..  of  F  satisfy 

11  JJ 

fii  _  xi(tf>2  .. 
fjj  X.(tf)2 


(B-13) 
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i 


This  determines  f..  relative  to  f^;  hopefully  the  resulting  optimal  control 
will  operate  to  yield  values  of  the  terminal  state  which  approximately 
satisfy  Eq.  (B-12).  Relative  values  of  the  elements  in  two  different  weight¬ 
ing  matrices  can  be  chosen  from  rough,  a  priori  knowledge  of  the  mag¬ 
nitudes  of  the  penalty  terms  Jf,  Jx  and  Ju  which  might  result  in  a  reason¬ 
able  case .  For  example ,  suppose  it  is  known  that  the  level  of  control 
Uj(t)  required  to  yield  a  desired  value  of  Xj(tf)  has  a  magnitude  on  the  oraer 
of  Mj.  Then  let 


%  =  W 


(B-14) 


Using  some  heuristic  method  of  this  type,  one  arrives  at  trial  values  of 
F,  Q(t)  and  R(t).  If  the  resulting  optimal  control  yields  trajectories  for 
x(t)  and  u(t)  which  appear  unsatisfactory,  adjustments  can  be  made  to  the 
weighting  matrices  to  change  the  system  behavior. 

The  fact  that  trial  and  error  are  required  to  determine  appro¬ 
priate  weighting  matrices  for  the  performance  index  tends  to  contradict 
the  basic  philosophy  of  optimal  control.  The  latter  implicitly  c  cm  tains  the 
idea  that,  given  required  constraints  upon  the  state  and  control  variables 
of  a  dynamical  system,  a  performance  index  which  accurately  represents 
the  cost  of  designing  and  implementing  the  system  can  be  devised.  It  is 
desired  that  this  index,  and  no  other,  be  minimized;  however,  in  prac¬ 
tical  applications  considerations  often  arise  which  modify  this  design 
concept.  In  particular,  frequently  one  has  certain  performance  criteria 
in  mind  which  cannot  be  easily  incorporated  into  the  mathematical  formu¬ 
lation  of  an  optimal  control  problem.  As  an  illustration  ncte  that  the  per¬ 
formance  index  in  Eq.  (B-3)  is  a  weighted  sum  of  several  different  cost 
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factors  which  compete  with  one  another.  The  designer  may  be  primarily 
interested  in  the  values  of  individual  tor  ms,  such  as 

A 

\  u.(t)2  dt 

‘o 

Each  term  of  this  type  is,  for  the  optimal  control  situation,  dependent 
upon  the  weighting  matrices  used  to  define  the  performance  index  and  it 
cannot  be  evaluated  without  actually  solving  the  control  problem  for  speci¬ 
fic  choices  of  F,  Q(t)  and  R(t).  Thus  several  trial  designs  may  be  needed 
before  the  designer  is  satisfied  with  the  collective  behavior  of  the  state 
and  control  variables  and  some  compromise  in  his  "true"  objectives  may 
be  required.  Other  examples  of  design  criteria  that  are  difficult  to  treat 
mathematically  are  ranges  of  allowable  settling  time  and  overshoot  in 
response  to  step  commands.  In  these  cases  quadratic  performance  indices 
can  also  be  used  as  an  artifice  to  generate  a  family  of  trajectories  from 
which  one,  having  the  additional  desired  features,  is  selected.  In  view  of 
these  contingencies,  optimal  control  theory  for  linear  systems  using  qua¬ 
dratic  performance  indices  can  be  a  useful  design  aid  but  it  is  no  panacea. 


B.4  THE  TIME -INVARIANT  UNEAR  REGULATOR 

An  important  special  case  of  the  linear  optimal  regulator  des¬ 
cribed  in  Section  B.2  occurs  when  Eq.  (B-2)  and  the  weighting  matrices 
are  time  invariant  —  A{t)  and  B(t)  are  constant  matrices,  A  and  B  --  and 

F  =  0;  Q(t)  =  Q;  R(t)  =  R 

For  this  application  interest  is  focused  on  the  form  of  the  optimal  control 
law  as  tf  approaches  infinity.  Under  the  added  restriction  that  the  linear 
constant  system 
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x(t)  =  Ax(t)  +  Bu(t)  ;  x(tQ)  =  xQ  (B-15) 

is  controllable*  it  has  been  shown  (Ref.  141)  that  the  feedback  control 
which  minimizes 

oo 

J=  ^  [x(t)T  Qx(t)  +  u(t)T  Ru(t)]  dt  (B-16) 

‘o 

is  given  by 

u(t)  =  -R“1BTSx(t)  (B-17) 

The  constant  matrix  S  is  the  unique  positive  semidefinite  solution  to  the 
"steady  state"  matrix  Riccati  equation** 

SA  +  ATS  -  SBR-1  BTS  +  Q  =  0  (B-18) 

The  feedback  gain  matrix 

K  =  R'1  BTS, 

is  also  constant,  requiring  much  less  storage  than  the  time -dependent 
controller. 

Perhaps  the  most  important  property  of  the  above  control  law 
is  that  the  closed  loop  system 

x(t)  =  (a  -  BR-1  BTs)  x(t)  (B-19) 

♦ 

See  Appendix  A  for  a  definition  of  controllability. 

When  any  eiger  alues  of  A  have  nonnegative  real  parts,  some  additional 
restrictions  on  the  weighting  matrix  Q  are  required  (Eef.  147)  in  order 
that  there  be  a  unique  positive  semidefinite  solution  to  Eq.  (B-17). 
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is  asymptotically  stable  in  the  large  (Ref.  141).  Hence  even  though  there 
is  some  subjectivity  in  the  choice  of  Q  and  R,  various  trial  values  of  the 
latter  always  lead  to  a  set  of  feedback  gains  for  which  the  system  has 
desirable  stability  properties.  When  the  dimensions  of  the  state  and  con¬ 
trol  are  large,  solving  Eq.  (B-18)  is  a  convenient  method  for  selecting 
various  values  of  K  which  have  this  property. 


B.  5  VARIATIONS  ON  THE  LINEAR  OPTIMAL  REGULATOR 

One  objection  to  the  optimal  regulator  problem  as  posed  in 
Sections  B.2  and  B.4  is  that  the  control  law  requires  knowledge  of  all 
state  variables.  If  a  filtering  operation  as  suggested  in  Eq.  (B-ll)  must 
be  performed  on  the  available  measurements  to  accomplish  this  task,  the 
system  complexity  increases.  To  avoid  this  difficulty  it  may  be  desir¬ 
able  to  formulate  an  optimal  control  problem  which  requires  only  par¬ 
tial  state  feedback.  This  is  done  simply  by  requiring  that  a  control  u(t) 
having  the  restricted  form 

u(t)  =  -  [Kr(t)  |  [0]]  x(t)  (B-20) 

be  determined  such  that  a  performance  index  J  (either  Eq.  (B-3)  or  (B-16) 
is  minimized  subject  to  the  equations  of  motion;  e.  g. , 

x(t)  =  Ax(t)  +  Bu(t)  (B-21) 

The  zeros  in  the  gain  matrix  defined  in  Eq.  (B-20)  multiply  those  ele¬ 
ments  of  x(t)  which  are  not  available  for  feedback.  This  is  a  seemingly 
minor  modification  to  the  optimal  regulator  problem;  however  it  consider¬ 
ably  complicates  the  procedure  for  determining  the  feedback  control  law. 

A  numerical  method  for  determining  the  gains  Kr(t)  is  given  in  Ref.  142. 
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Another  technique  for  determining  a  restricted  set  of  feedback 
gains  is  given  in  Ref.  143.  Here  the  condition  on  u(t)  is  the  same  as 
Eq.  (B-20)  but  the  quadratic  performance  index  includes  cross  product 
terms  in  the  state  and  control.  This  leads  to  a  simpler  solution  for  Kr(t) 
but  the  physical  significance  of  the  cost  functional  becomes  obscurred  by 
the  special  techniques  used  to  select  certain  of  its  weighting  matrices. 


In  Section  B.  3  it  is  mentioned  that  trial  and  error  are  often 
required  to  design  the  optimal  regulator  because  the  performance  index 
is  really  a  weighted  sum  of  several  competing  indices.  To  eliminate  the 
need  to  select  weighting  constants,  it  Can  be  required  that  each  separate 
term  of  J  take  on  specified  values,  except  the  one  which  is  to  be  mini¬ 
mized.  This  results  in  an  optimal  control  problem  with  integral  con¬ 
straints.  To  illustrate  this  idea,  consider  the  problem  of  choosing  a 
control  u(t)  for  the  system 


x(t)  =  Ax(t)  +  Bu(t)  ;  x(t0)  =  x0 

such  that  the  performance  index 


J 


u 


Ru(t)  dt 


(B-22) 


(B-23) 


in  minimized  subject  to  the  constraint 


Jx  =  \  x{t)  Qx(t)  =  c 


(B-24) 


where  c  is  a  specified  constant.  For  this  illustration  we  take  F  =  0  in 
Eq.  (B-4).  The  motivation  for  this  problem  formulation  is  the  desire  to 
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insure  that  x(t)  is  ’’small  enough”  in  the  interval  (t0,tf).  Therefore  a 
specific  value  is  assigned  to  Jx  rather  than  including  it  in  the  perform¬ 
ance  index,  as  is  done  for  the  optimal  regulator  in  Eq.  (B-3).  Conse¬ 
quently  there  is  no  need  to  consider  what  the  "size”  of  Q  should  be  rela¬ 
tive  to  R. 


To  solve  the  above  problem  it  is  convenient  to  proceed  as  if  we 
want  to  minimize  the  quantity 


J  = 


Ju+X 


Q  x(t)  dt 


(B-25) 


where  X  is  a  weighting  constant  to  be  specified  presently;  for  the  moment 
we  ignore  Eq„  (B-24).  The  solution  to  this  problem  is  obtained  directly 
from  Section  B.2;  the  optimal  control  is  given  by 

u(t)  =  -  R"1  BT  S(t)  x(t)  (B-26) 


where  S(t)  satisfies 

S(t)  =  -  S(t)  A  -  ATS(t)  +  S(t)  B  R-1  BTS(t)  -  XQ  (B-27) 

S(tf)  =  0 

This  equation  can  be  solved  once  X  is  known.  If  X  can  be  chosen  so  that 
Eq.  (B-24)  is  satisfied,  the  solution  to  the  problem  with  J  constrained 
is  determined. 

Appealing  to  the  adjoint  theory  for  linear  systems*  we  know 

that  for  any  feedback  control  law 

*  ___________________ 

See  Appendix  G. 
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u(t)  =  -K(t)  x(t)  (B-28) 

applied  to  Eq.  (B-21),  the  value  of  J  .  caused  by  initial  conditions  x(t)  at 
any  time  t  is  determined  by  the  relation 

Jx  =  x(t)T  C(t)  x(t)  (B-29) 

where  C(t)  is  the  solution  to 

T 

C(t)  =  -  (a  -  B  K(tj)  C(t)  -  C(t)  ^A-  B  K(t)j  -  Q(t) 

C(tf)  =  0  (B-30) 

If  an  optimal  control  law  is  used  we  know  from  Eq.  (B-26)  that 

K(t)  =  R"1  Bt  S(t)  (B-31) 

Substituting  from  Eq.  (B-31)  into  Eq.  (B-30)  and  combining  with  Eq.  (B-26) 
yields  the  following  control  law  for  minimizing  the  performance  index  Jy, 


subject  to  the  constraint  on  J  in  Eq.  (B-24): 

A 

u(t) 

=  -  R"1  BTS(t)  x(t) 

S(t) 

=  S(t)  A  -  ATS(t)  +  S(t)  B  R_1  BTS(t) 

-XQ 

C(t) 

=  -  (a  -  BR"1BTS(t)^T  C(t)  -  C(t)  j 

!a-br" 

LBTS(t)^  -Q 

S(tf) 

n 

n 

ii 

o 

x(to)T 

c(t0)  x(t0)  =  c 

(B-32) 

Because  X  is  unknown,  Eq.  (B-32)  is  a  set  of  coupled  non¬ 
linear  differential  equations  with  boundary  conditions  at  both  end  points. 
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Consequently  they  must  generally  be  solved  by  some  numerical  method. 
An  iterative  type  procedure  can  be  used  as  follows: 

•  Guess  a  value  of  X . 

•  Integrate  the  differential  equations  for  S(t) 

and  C(t)  backward  from  t,  to  t  . 

fo 

•  Check  to  see  whether  the  expression 

xT(t0)  C(t0)  x(t0)  =  c 

is  satisfied.  If  so,  S(t)  yields  the  desired 
feedback  gains;  if  not,  choose  a  new  value 
of  X  and  repeat  the  above  steps. 

Successive  choices  of  X  can  be  made  by  any  convenient  finite -difference 
type  method. 

Because  the  last  expression  in  Eq.  (B-32)  depends  upon  the 
initial  value  of  the  state,  the  solutions  for  S(t)  and  K(t)  also  depend 
upon  x(t0),  making  the  control  law  partially  open  loop  in  character.  To 
avoid  this  situation  it  is  desirable  to  modify  the  condition  on  J  by 
normalizing  the  integral  in  Eq.  (B-24).  Thus  require  instead  That 


max 

x(t0) 


(  x(t)T  Qx(t) 
o 


x(t0)  x(t0) 


(B-33) 


Referring  to  Eq.  (B-29)  it  follows  that 
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J  =  max 
"  x(tQ) 

The  scalar  quantity  |C(tQ)|  is  referred  to  as  the  norm  of  the  matrix  C(t0); 
it  can  be  evaluated  from  the  relation  (Ref.  144) 

jc(t0)|  =  max  p.  (C(t0))  =  c  (B-35) 

i 

where  is  the  i  n  eigenvalue  of  C(t0). 

Replacement  of  the  fifth  expression  in  Eq.  (B-32)  by  Eq.  (B-35) 
eliminates  the  dependence  of  the  optimal  control  law  upon  the  initial  state; 
however  it  still  depends  upon  the  initial  time.  The  last  objection  is  elimi¬ 
nated  in  the  steady  state  situation  where  tj  approaches  infinity.  Just  as  in 
the  optimal  regulator,  the  corresponding  feedback  gains  become  constants 
which  satisfy 

u(t)  =  -  Kx(t) 

K  =  R"1  BTS 

sa+ats-sbr_1  bts+xq  =0 

(a  -  BR"1  BTs)T  C  +C  (a  -  BR"1  BTs)  +  Q  =  0 

max  p.(C)  =  c  (B-36) 

The  relations  in  Eq.  (B-36)  provide  a  method  for  choosing  con¬ 
stant  feedback  gains  so  that  Eq.  (B-34)  is  satisfied  as  tf  approaches 
infinity.  The  numerical  effort  required  to  determine  K  is  consider¬ 
ably  greater  than  for  the  optimal  regulator;  however  there  is  less  sub¬ 
jectivity  in  defining  a  performance  index.  The  extension  to  more  than  one 


x(t0)  C^o^o) 


x(t0)T  x(t0) 


|C(t0)|  =  c 


(B-34) 
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quadratic  integral  constraint  can  be  made  quite  readily  by  introducing 
more  ’’multipliers”  X.  However  each  new  constraint  requires  an  addi¬ 
tional  matrix  equation  similar  to  that  for  C  in  Eq.  (B-36)  and  an  addi¬ 
tional  eigenvalue  condition.  It  is  possible  that  the  conditions  leading  to 
the  matrix  norm  in  Eq.  (B-34)  can  be  changed  to  permit  a  simpler  com¬ 
putation  than  determining  the  maximum  eigenvalue  of  C.  This  procedure 
for  selecting  a  set  of  constant  feedback  gains  is  largely  experimental  at 
present  and  little  experience  with  the  resulting  system  performance  or 
the  feasibility  of  possible  modifications  in  the  problem  formulation  is 
currently  available.* 

This  concludes  a  summary  of  several  important  features  of 
modem  optimal  control  theory.  It  is  intended  only  as  a  review  of  the 
material  considered  in  this  report;  for  additional  information  the  reader 
is  referred  to  the  aforementioned  textbooks  (Refs.  97,  137)  and  the 
literature. 


j|C 

There  are  apparently  some  unanswered  questions  about  existence  of 
solutions  to  this  type  of  problem  formulation. 


B-17 


THE  ANALYTIC  SCIENCES  CORPORATION 


APPENDIX  C 


AN  ANALYSIS  OF  THE  M.I.T.  GRADIENT  METHOD 


This  appendix  is  concerned  with  an  examination  of  the  M.I.T. 
parameter  adjustment  rule  (Ref.  40),  to  determine  the  extent  to  which  it 
achieves  the  gradient  algorithm  described  in  Section  4.2  and  given  in 
Eq.  (4.2-10).  Recall  that  the  objective  is  to  change  a  parameter  value 
h.  by  an  amount  Ah^,  given  by 


t.+1  r 

Ah.  ft.)  =  -  cl  \  L  (t)  e.  (t)  i 

'  v  •  ■'  L) 


(C-l) 


where 


Vi-*j  =  T 


and  subscripts  e  and  denote  partial  differentiation  with  respect  to  e  and 
hj(tj)  (see  Eq.  (4.2-11)).  The  quantity  L[  e(t)]  is  a  positive  function  of  the 
error  signal.  As  an  alternative  to  Eq.  (C-l)  the  M.I.T.  rule  uses  the 
analog  gain  adjustment  algorithm 


\(t)  =  -«i  Le(t)  eh.(t) 


(C-2) 


The  actual  total  change  in  hi(t)  is  not  completed  until  time  t.+j  and  is  given  by 


#j+l)  -  -  ^  1,+1  Le(t)  V 


(t)  dt 


(C-3) 


with  continuous  updating  of  hj(t).  In  order  that  this  procedure  be  justified 
on  the  basis  of  gradient  arguments,  it  is  required  that 
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Afii(Vi)*  Ahi(‘j)  <c'4» 

The  first  observation  is  that  in  general  strict  equality  does  not 

hold  in  Eq.  (C -4)  because  ir  is  held  constant  in  Eq.  (C-l)  at  the  value  hj(tj), 

whereas  in  Eqs.  (C-2)and  (C-3)lr(t)  is  continuously  updated.  Because  e(t)  is 

a  functional  of  h(t)  through  Eq.  (4.2-6),  the  integrands  of  the  two  expressions 

differ.  This  fact  can  be  detrimental  to  system  performance  if  Eq.  (C-2) 

adjusts  the  adaptive  gains  in  the  wrong  direction  near  the  beginning  of  the 

interval,  thereby  causing  significant  variations  in  L  (t)  and  eK.(t)  later  in 

e  ui 

the  interval.  It  is  possible  that  the  resulting  closed  loop  adaptive  system 
coulc’  be  made  unstable  by  this  behavior.  This  potential  difficulty  can  be 
avoided  by  updating  h.  only  at  discrete  times  tj,  as  suggested  in  Section 
4.2.2. 


Another  comparison  between  Eqs.  (C-l)  and  (C-3)  is  afforded 

by  the  M.I.T.  procedure  for  computing  eh  (t)  for  t  ^  t.  given  in  Eq.  (4. 2-18), 

i  1 

and  repeated  here  as 

e.  (t)  =  cT  *h  (t) 
ni  “  i 

=  Am\(t)  '  5mxi(t);  Hhi(t0>  =  2  <c-5> 

In  order  that  Eq.  (C-4)  hold  for  successive  integration  intervals,  the  quan¬ 
tity  xh.(t)  should  be  reinitialized  to  zero  in  the  adaptive  controller  at  times 
tj,  j  =0,1,...  This  corresponds  to  beginning  the  integration  in  Eq.  (C-l) 
over  successive  intervals,  with 


However,  in  the  M.I.T.  method  Eq.  (C-5)  is  implemented  continuously 
from  t  =  tg  without  ever  resetting  to  zero.  Presumably  this  is  a 
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matter  of  convenience  to  avoid  the  digital  logic  associated  with  reinitiali¬ 
zation.  Thus  at  first  glance,  the  M.I.T.  adaptation  rule  looks  like  a 
gradient  procedure  only  over  the  first  T  seconds;  for  subsequent  inter¬ 
vals,  the  integral  in  Eq.  (C-3)  might  be  substantially  different  from  that 
in  Eq.  (C-l). 

To  analyze  this  aspect  of  the  M.I.T.  method,  suppose  the 
adaptive  system  is  in  equilibrium  at  time  t  >  t^  with 

h(t)  =  0;  e(t)  =  0 

At  this  instant  let  a  change  in  some  system  parameter  occur  so  that  an 
error  begins  to  develop.  For  t  s  t  >  tg  the  solution  of  Eq.  (C-5)  is 

t 

eh(t)  -  -  cT  (  4>m(t,r)  bm  x.(T)  dr  (C-6) 

1  *0 

where  <J>m(t,  r)  is  the  transition  matrix4  for  the  system  model, 
other  hand,  if  ^.(fc)  is  set  to  zero  at  t^,  then 

t 

e,  (t)+  -  -  (  $  (t, r)b  x.(t)  dT 

h.  '  -  )  m  ’  -m  i 

1 

where  the  superscript  t  means  that  the  differential  equations  for  x.(t)  are 
reinitialized  at  t  =  t  .  The  error  incurred  in  using  the  solution  of  (C-6) 

At 

rather  than  (C-7)  in  the  parameter  adjustment  algorithm  of  Eq.  (C-2)  is 
related  to  the  quantity 

_ 

See  Appendix  A  for  a  definition  of  transition  matrix. 


On  the 
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Ae,  ^  ~  eh  ®  “  eh 
1  ni  i 


c  \  $  (t,  t)  b  x.(t)  dT 

-  J  m  ’  -m  1 


(C-8) 


L0 


Assuming  the  linear  time -invariant  reference  model  is  chosen  to  be 
asymptotically  stable,  the  elements  of  the  transition  matrix  approach 
zero  exponentially  at  a  rate  determined  by  the  dominant  time  constant 
associated  with  model  pole  locations;  i.e„,the  magnitudes  of  the  elements 
of  $>m(t,r)  in  Eq.  (C-8)  regarded  as  functions  of  t  approach  zero  ?s 
ce-er(t“T)  for  some  positive  values  of  c  and  cr.  Consequently  |  Ae^t)  |  as 
given  by  Eq.  (C-8)  is  small  if 

a(t  -r)  »  0;  tQs  t  s  t£  (C-9) 

The  condition  expressed  by  Eq.  (C-9)  holds  for  most  values  of  t  within 
the  gain  adjustment  interval,  t  s  t  s  t  +T,  if 

Xj  Xj 

T  »  rm  (C-10) 

whore  r  is  the  model  settling  time.  If  Eq.  (C-10)  is  satisfied,  the 

HI 

resulting  integrated  effect  of  Ae^(t)  upon  h^  also  tends  to  be  small. 

The  conclusion  is  that  the  analog  adaptive  algorithm  reliably  pro¬ 
duces  the  weighting  function  e^t)  required  in  Eq.(C-l)only  for  an  integra¬ 
tion  interval  which  is  significantly  longer  than  the  model  settling  time. 
Therefore  if  the  gain  a.  is  assigned  a  value  much  greater  than  is  justified  for 
such  a  long  interval,  or  if  the  system  parameters  and  command  input  vary 
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considerably,  the  real-time  parameter  adjustment  rule  expressed  by 
Eq.  (C-2)  does  not  necessarily  correspond  to  a  gradient  method. 


The  object  here  is  to  provide  a  qualitative  measure  of  the  con¬ 
ditions  under  which  an  analogy  between  the  M.  I.T.  method  and  gradient 
techniques  can  be  made.  It  may  be  observed  in  particular  applications 
that  the  adaptive  system  of  Fig.  4.2-1  behaves  satisfactorily  even  when 
the  design  assumptions  do  not  hold;  such  characteristics  are  fortuitous 
but  they  cannot  be  predicted  from  the  theory.  A  by-product  of  this  dis¬ 
cussion  is  recognition  that  by  discretely  updating  the  adaptive  gains  and 

reinitializing  xh  in  Eq.  (C-5)  at  times  t.,  i  =  0,  1, . . . ,  the  analogy  be- 
“i  3 

tween  M.I.  T.  and  gradient  methods  can  be  made  for  any  value  of  T  in 
Eq.  (C-l).  A  short  integration  interval  is  desirable  to  achieve  a  rapid 
convergence  rate  because  many  adjustments  of  the  type  given  in  Eq. 
(C-l)  are  required  to  achieve  the  minimum  of  J  in  Eq.  (4.2-1),  for 
fixed  values  of  the  plant  parameters.  Because  the  increments,  Ah.,  are 
to  be  computed  in  real  time  over  successive  intervals  of  length  T,  suf¬ 
ficient  total  time  is  required  to  provide  enough  steps  for  adequate  con¬ 
vergence.  Assuming  that  the  allowable  change  in  h  which  satisfies  con¬ 
dition  (C-4)  with  adequate  accuracy  is  independent  of  the  length  of  the 
interval  and  assuming  the  cost  function  is  invariant  with  time,  the  con¬ 
vergence  rate  of  the  gradient  technique  is  inversely  proportional  to  T. 
Typically  then,  recalling  the  condition  given  by  Eq.  (4.2-4),  T  should 
be  specified  by 


T 


(C-U) 


provided  that  xj^(t)  is  reset  to  zero  at  the  beginning  of  each  adjustment 
interval.  This  reasoning  leads  to  the  discrete  form  of  the  M.I.T.  method 
described  in  Section  4.2.2. 
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APPENDIX  D 
STABILITY  THEORY 

In  this  appendix  definitions  and  theorems  related  to  stability 
theory  which  are  helpful  in  analyzing  and  designing  adaptive  control  sys¬ 
tems  are  given „  An  extensive  body  of  literature  exists  on  this  subject, 
some  of  which  is  provided  in  Refs.  64-71.  Excellent  basic  treatments 
are  provided  in  Refs.  64  ,  87  ,  68  and  71.  No  attempt  is  made  to  give  a 
complete  survey  of  the  subject  here;  only  material  required  in  this 
report  is  presented. 

D„  1  DEFINITIONS 

We  begin  by  introducing  definitions  of  various  types  of  stability 
which  apply  to  the  asymptotic  behavior  of  solutions  of  a  set  of  time- 
invariant,  first  order  differential  equations  beginning  at  time  tQ ; 

x  =  f(x);  x(t0)  =  x0  (D-l) 

We  are  interested  in  the  properties  of  x(t)  for  t  >  tQ  near  the  equilibrium 
point,  x0,  which  satisfies 

f(xe)  =  0  (D-2) 

By  redefining  the  state  with  a  shift  of  origin, 

x(t)  =  -  z(t) 

one  can  always  make  the  equilibrium  solution  for  z(t)  equal  to  zero. 
Therefore  without  loss  of  generality,  assume  x  -  0. 

v 
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The  solution  to  Eq.  (D-l)  near  the  origin  ^an  be  qualitatively 
described  in  terms  of  the  following  definitions  (Refs.  64  and  67): 


1.  The  origin  is  a  stable  equilibrium  point  of  Eq.  (D-l) 
if  for  every  real  number  6  >  0,  there  exists  a  real 
number,  c(6)  (depending  upon  6),  -such  that  the 
condition 


implies 

|x(t)|  s  6;  t  2  t0 

In  other  words,  x(t)  is  bounded  in  magnitude  by  an 
arbitrarily  small  number  if  the  initial  conditions 
are  sufficiently  small. 

2.  The  origin  is  an  asymptotically  stable  equilibrium 
point  of  Eq.  (D-l)  whenever  it  is  stable  and,  in 
addition, 


lim  1  x(t )  1  ~  0 

t-»w 

More  precisely,  there  is  some  real  number  y  >  0 
such  that  for  every  6  >  0  there  is  also  a  number 
T(6;  xQ)  for  which 

IXqI  *  y 

implies 

|  x(t)  |  s  6  for  all  t  >  t0  +  T 

These  definitions  provide  only  local  descriptions  of  system 
behavior.  That  is,  the  number  c(5)  might  have  a  finite  upper  bound  over 
all  values  of  6.  This  would  mean  that  for  some  value  of  the  initial  state 
whose  magnitude  is  larger  than  the  bound,  the  convergence  properties 
expressed  in  the  definitions  would  not  hold.  In  many  cases  one 
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desires  a  stronger  condition  on  system  behavior  such  as  that  pro¬ 
vided  by: 


3.  The  solution  to  Eq.  (D-l)  is  globally  asymptotically 
stable  if  it  is  stable  (in  the  sense  of  definition  1 

above]  and 


lim  |  x(t)  |  =  0 

t  -»  03 


for  all  xQ. 

Definition  3  guarantees  a  well  behaved  system  no  matter  how  large  is 


The  above  definitions  can  be  applied  to  an  adaptive  missile 
autopilot  having  constant,  possibly  unknown,  plant  parameters  and  a 
constant  input.  Ideally,  global  asymptotic  stability  of  the  equations  of 
motion  (e.g. ,  Eq.  (4.4-15))  for  the  plant  and  adaptive  gains  about  their 
equilibrium  solution  is  desired.  Some  theorems  which  are  helpful  in 
achieving  this  condition  are  provided  in  the  following  sections. 


D.  2  LINEAR  TIME -INVARIANT  SYSTEMS 

Conditions  for  achieving  stability  and  global  asymptotic  stability 
of  solutions  to  the  time -invariant  linear  equations, 

x(t)  =  Ax(t)  (D-3) 

are  readily  stated.  Because  the  solution  to  Eq.  (D-3)  can  be  obtained 
analytically  as  a  linear  combination  of  terms  like 
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where  X^  is  an  eigenvalue  of  A  having  multiplicity*  j^,  a  necessary  and 
sufficient  condition  for  stability  is: 

Re(Xj)  *  0  for  all  i;  if  Re(Xi)  =  0  then  there  exist 
linearly  independent  eigenvectors  --  k  -  1, . . , 
j.  —  which  satisfy 

A^k 

For  global  asymptotic  stability  it  is  necessary  and  sufficient  that 
Re(Xi)  <  0  for  all  values  of  i. 

D.  3  LINEAR  PERIODIC  SYSTEMS  (Floquet  Theory) 

special  type  of  time-varying  linear  system 

x(t)  =  A(t)  x(t)  (D-4) 

which  can  be  treated  in  terms  of  the  definitions  stated  in  Section  D.  1  is 
the  case  when  A(t)  is  periodic  with  period  T, 

A(t  +  T)  =  A(t) 

It  can  be  shown  (Ref.  71)  that  the  transition  matrix**  associated  with  A(t) 
is  of  the  form 

R(t-t0) 

*(t,t0)  =  P(t,t0)  e  (D-5) 

where  P(t,to)  is  periodic  in  T  and  R  is  a  constant  matrix;  eR  is  defined 
in  Eq.  (A-6).  The  solution  x(t)  to  Eq.  (D-4)  at  times  t0+nT  —  n  =  0, 1, . .  — 
is  given  by** 

x(t0  +  nT)  =  $(t0  +  nT,  !<,)  x0  (D-6) 

* 

That  is,  there  are  ij  eigenvalues  of  A  which  have  the  value  Xj. 

See  Section  A.  2. 
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Substitution  of  Fq.  (D-5)  into  Eq.  (D-6)  and  using  the  fact  -- 
P(t0+nT,t>)  =  P(t0,t0)  =  I  —  produces 

x(t0+nT)  =  eRnTx0  =  $(t0 +T,t0)n  xQ 

Thus  the  behavior  of  x(t)  at  integral  periods  can  be  described  by  a  dif¬ 
ference  equation, 

x(t0+nT)  =  *(t0+T,t0)  x(t0  +  (n-l)T)  (D-7) 

where  $  is  independent  of  n.  In  terms  of  the  eigenvalues  Xj  of  $>(t0  +  T,t0), 
each  having  multiplicity  h,  a  necessary  and  sufficient  condition  for 
stability*  of  solutions  to  this  difference  equation  is  (Ref.  68): 

|  X j |  £  1  for  all  i;  if  |  X i i  =1  then  there  exist  jj 
linearly  independent  eigenvectors  £k  k  =  1>  •  •  > 
jj  —  which  satisfy 

^(to+T»0ik  =  \-k 

For  global  asymptotic  stability  it  is  necessary  and  sufficient  that 

I X  i  I  <  1;  all  i 

These  conditions  together  with  Eq.  (D-5)  imply  the  respective  stability 
and  asymptotic  stability  of  x(t)  for  all  time. 

The  above  discussion  demonstrates  that  the  stability  proper¬ 
ties  of  Eq.  (D-4)  depend  upon  the  eigenvalues  of  the  matrix 

$(t0+T,t0)  =  eRT  (D-8) 


Stability  and  asymptotic  stability  of  difference  equations  are  defined  in 
the  same  manner  as  in  Section  D.  1  except  that  t  is  replaced  by  nT. 
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In  general  the  matrix  R  on  the  right  side  of  Eq.  (D-8)  cannot  be  determined 
analytically;  however,  it  may  be  practical  to  obtain  $>(t0  +T,tQ)  by 
numerically  integrating  its  differential  equation 

*(t,t0)  =  A(t)$(t,t0)  ;  «Kt0,t0)  =  I 

over  the  interval  tQ  s  t  s  tQ  +  T,  especially  if  T  is  relatively  small.  In 
this  manner  the  stability  of  periodic  systems  can  often  be  ascertained. 


D.4  NONUNEAR  SYSTEMS  (Linearization) 

When  presented  with  a  nonlinear  system  to  analyze,  as  in 
Eq.  (D-l),  commonly  one's  first  approach  is  to  linearize  the  equations 
about  an  equilibrium  point  and  study  their  behavior  with  higher  order 
terms  neglected.  To  this  end  suppose  Eq.  (D-l)  can  be  written  as 

x(t)  =  j[x(t)]  =  Ax(t)  +  g[x(t)]  (D-9) 


where 


lim 
|  x  |  —  0 


0 


and  A  is  a  constant  matrix.  Typically  A  is  determined  by  calculating 


af(x) 


4 


3x 


x  =  0 


The  behavior  of  x(t)  about  the  origin  can  be  expressed  in  terms 
of  properties  of  the  matrix  A.  Recall  that  local  stability  properties  are 
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expressed  by  Definitions  1  and  2  in  Section  D.  1.  In  particular,  it  has 
been  proved  (Ref.  71)  that  the  solution  to  Eq.  (D-9)  is  (locally) 
asymptotically  stable  if  the  eigenvalues  X  j  of  A  satisfy 

Re(Xj)  <  0  ;  for  all  i 

and  that  the  origin  in  unstable  (i.  e.,  it  is  not  stable  in  the  sense  defined 
in  Section  D.  1)  if 


Re(Xi)  >  0  ;  for  any  i 

In  a  similar  manner  the  Floquet  theory  described  in  Section  D„  3  can  be 
applied  to  determine  local  stability  properties  when  A  is  periodic. 

An  important  observation  is  the  fact  that  nothing  general  can 
be  said  when 


Re(Xi)  *  0  ; 

alii 

Re(Xj)  =  0  ; 

some  j 

For  this  case  tne  effect  of  the  nonlinear  terms  must  be  investigated;  this 
fact  is  effectively  demonstrated  in  Eq.  (4. 2-44).  One  special  case  where 
some  eigenvalues  of  A  have  nonzero  real  parts  is  treated  in  Appendix  E. 


D.  5  NONLINEAR  SYSTEMS  (Liapunov  Theory) 

For  the  nonlinear  system, 

x(t)  =  f[x(t)]  (D-10) 

global  stability  properties  can  often  be  determined  by  methods  associated 
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principally  with  Liapunov  (Refs.  65,  67,  68).  The  relevant  theorems  are 
stated  without  proof  in  this  section. 

Theorem  1  (Ref.  67) 

Let  there  exist  a  scalar  function  V(x)  associated  with 
the  state  of  Eq.  (D-10)  and  having  the  following 
properties : 

(a)  V(x)  and  its  first  order  partial  derivatives  are 
continuous  in  a  certain  open  region  R  about  the 
origin  in  n -dimensional  Euclidean  space. 

(b)  V(x)  >  0;  x  i  0,  x  hi  R 

(c)  V(0)  =  0 

Then  the  following  statements  are  true  with  respect 
to  solutions  x(t)  of  Eq.  (D-10): 

t 

1(a)  If  V(x(t))  s  0  along  solutions  of  Eq.  (D-10)  in 
R,  the  origin  is  stable.  A  V(x)  with  this 
property  and  which  satisfies  conditions  (a), 

(b)  and  (c)  above  is  called  a  Liapunov  function. 

1(b)  If  V(x(t))  <  0  along  solutions  of  Eq.  (D-10)  in 
R,  the  origin  is  asymptotically  stable. 

1(c)  If  1(b)  holds  and  R  is  the  whole  Euclidean 
space,  denoted  by  the  symbol  En,  with 

lim  V(x)  =  », 
jx|-» 

then  the  origin  is  globally  asymptotically 
stable. 

1(d)  If  1(a)  holds,  if  R  =  En,  if  V(x(t))  does  not 
vanish  identically  in  t  *  t0  for  any  to  and 
any  xQ  ^  0,  and  if 

lim  V(x)  =  » 

1*1 — 

then  the  origin  is  globally  asymptotically 
stable. 
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The  details  of  the  above  theorem  can  be  motivated  by  Fig.  D-l  which 
illustrates  possible  trajectories  for  x(t).  The  contours  of  constant  V(x) 
are  closed  in  the  vicinity  of  the  origin.  If  the  contour 

V(x)  =  V(x0) 

is  contained  in  R  as  indicated  and  if  condition  1(a)  of  the  theorem  holds, 
V(x(t))  is  never  increasing  and  the  trajectory,  denoted  by  x  ja(t),  must 
remain  within  the  contour.  On  the  other  hand  if  condition  1(b)  holds, 
V(x(t^)  is  constantly  decreasing  and  the  trajectory,  denoted  by  x^t), 
must  eventually  approach  the  origin.  Similar  arguments  apply  to  the 
other  conditions  of  the  theorem. 

Notice  that  the  conditions  of  the  theorem  require  one  to  find 
an  appropriate  V(x),  i.e.,  a  Liapunov  function;  however  there  is  no 
general  systematic  method  for  doing  this.  This  matter  receives  a  great 
deal  of  attention  in  the  literature;  suitable  functions  have  been  found  for 
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various  special  types  of  nonlinear  systems.  The  failure  to  find  a  form 
of  V(x)  which  has  +he  desired  properties  for  a  specific  case  does  not  mean 
that  one  doesn't  exist.  Consequently  proving  stability  with  Liapunov 
techniques  is  an  ad  hoc  procedure. 

The  next  theorem  treats  the  case  where  V(x(t))  =  0  along  some 
solution  to  Eq.  (D-10). 

Theorem  2:  (Ref.  65) 

Let  there  be  a  scalar  function  V(x)  satisfying  conditions 

(a)  and  (b)  of  Theorem  1  with  R  =  En  and  V(x(t))  s  0 

along  solutions  of  Eq.  (D-1C)  and 

lim  V(x)  =  » 

|x|— 

Let  T  be  the  set  of  values  of  x  such  that 
V(x(t))  =  0;  x  c  T 

and  let  S  be  the  largest  subset  of  T  such  that 
x(t0)  in  S=^>  x(t)  in  S;  t  * 

Then  all  solutions  converge  to  S  af?  t  -  °°. 

Theorem  2  says  that  under  suitable  conditions,  x(t)  must 
approach  a  set  of  values  S  such  that  V(x)  =  0  for  all  x  contained  in  the 
set.  This  result  is  particularly  useful  when  Eq.  (D-10)  does  not  have 
a  unique  equilibrium  point.  In  such  a  situation,  S  may  be  the  set  of  ail 
solutions  to  the  equation 

f(x)  =  0 

and  one  can  establish  the  convergence  of  x(t)  to  S. 
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All  of  our  discussion  thus  far  has  been  related  to  time- 
invariant  systems.  Some  remarks  about  the  time -varying  case  are  given 
in  the  next  section. 


D.  6  TIME -VARYING  SYSTEMS 


The  equations  under  consideration  in  this  section  are  of  the 

form 

x(t)  =  f[x(t),  t]  ;  x(t0)  =  xQ 

f(o,  t)  =  o  (D-ii) 

Most  of  the  definitions  and  theorems  of  the  preceding  sections  can  be 
modified  to  give  general  stability  conditions  for  solutions  to  Eq.  (D-ll); 
the  essential  difference  is  that  conditions  which  are  uniform  with  time 
must  be  imposed.  One  useful  result  applicable  to  Eq.  (D-ll)  is  a  simple 
extension  of  the  theorems  in  Section  D.  5  given  by  the  following: 


Theorem  3 

Suppose  we  are  given  a  function  Vj/x)  which  satisfies 
the  conditions  on  V(x)  in  respectively  Theorems  1  and/or 
2  of  Section  D.5  and  also  a  continuous  function  V2(x) 
satisfying  both 


V2(0)  =  o 

and  the  set  of  conditions  on  V(x)  in  Theorems  1  and/or  2 
respectively.  Then  if 

V^x^t)  s.  V2(x(t)) 

along  solutions  of  Eq.  (D-ll),  the  solution  of  Eq.  (D-ll) 
has  the  stability  properties  described  in  Theorems  1 
and/or  2  respectively. 
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This  can  be  viewed  as  a  special  case  of  the  general  theorems  for  time- 
varying  systems;  that  is,  Vj(x)  has  all  the  properties  required  of  a 
time-varying  Liapunov  function  for  Eq.  (D-ll)  (Refs.  65,  67). 

Thus  far  conditions  have  been  given  for  convergence  of  x(t)  to 
a  set  where  the  Liapunov  function  satisfies  the  condition 

V(x(t)  =  0 

It  has  been  assumed  that  motion  is  restricted  to  a  region  where 

V(x(t»  s  0  (D-12) 

However,  acceptable  operation  can  sometimes  be  attained  even  though 
V(x(t))  ^  0  inside  a  small  region  about  an  equilibrium  point;  one  might 
expect  convergence  of  x(t)  to  some  type  of  oscillatory  condition  within 
the  region.  Limit  cycling,  steady  state  solutions  achieved  in  some  non¬ 
linear  control  systems  are  examples  of  this  type  of  behavior.  These 
considerations  motivate  a  discussion  of  a  somewhat  different  type  of 
stability  than  that  defined  in  Section  D.  1. 

D.7  PRACTICAL  STABILITY 

Practical  stability  deals  with  the  behavior  of  the  solution  x(t) 
to  a  set  of  differential  equations 

x(t)  =  f  [x(t),  t]  (D-13) 

when  a  positive  scalar  function  V(x)  exists  such  that  V[x(t),t]  does  not 
satisfy  the  condition  in  Eq.  (D-12)  everywhere  in  the  vicinity  of  the 
equilibrium  point.  In  this  case  a  different  type  of  stability  from  that 
associated  with  Liapunov  theory  can  sometimes  be  established  (Refs.  65 
and  67).  A  useful  result  of  this  type  is  the  following: 
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Theorem  4 


Let  a  function  V^(x)  be  given  vhich  satisfies  the 
following: 

(a)  V-^x)  and  its  first  partial  derivatives  are 
continuous  in  En. 

(b)  VjCx)  >0;  x  /  0 

(c)  lim  V(x)  =  ® 

Also  let  there  be  a  continuous  function  V2(x) 
satisfying: 

(d)  V2(x)  >  0;  x  t  0 

(e)  V2(0)  =  0 

Now  define  a  set  of  f!  by  the  condition 
V2(x)  s;  M  <£>  x  in  S 

Then  if 

Vt(x(t),t)  <-V2(x(t))+M 

along  solutions  of  Eq.  (D-13),  it  follows  that  x(t)-T 
as  t-%  where  T  is  a  set  defined  by 

V^x)  s:  L  <£#>  x  e  T 

L  =  Max  (v,  (x)} 
x  in  S  1  1  ; 

The  content  of  the  theorem  is  illustrated  by  Fig.  D-2.  Any  solution  x^(t) 
wliich  starts  outside  T  must  converge  to  T  since 

<  0;  for  all  x  outside  T 
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Figure  D-2  Illustration  of  Motion  that 
Exhibits  Practical  Stability 

Furthermore,  any  solution  Xg(t)  which  begins  within  T  must  remain  in 
T  since 


Vj  <  0;  x  in  the  boundary  of  T 

This  concludes  our  summary  of  several  methods  for  deter¬ 
mining  stability  properties  of  differential  equations.  These  techniques 
are  important  for  achieving  satisfactory  designs  for  adaptive  controllers 
because  they  provide  a  qualitative  description  of  the  behavior  of  non¬ 
linear  and  linear  time-varying  systems , 
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APPENDIX  E 

A  LOCAL  STABILITY  THEOREM  FOR  ADAPTIVE  SYSTEMS 


In  this  appendix,  conditions  are  established  for  local  stability  of 
a  nonlinear  system  having  equations  of  motion  similar  to  Eq.  (4.2-40).  The 
need  for  this  arises  because  conventional  stability  theorems  cannot  be 
applied  to  a  linearized  system  when  it  has  some  closed  loop  poles  at  the 
origin  of  the  complex  plane. 


Consider  the  nonlinear,  time -invariant  system  described  by 

x(t)  =  Ax(t)  +  f [x(t)j  ;  x(0)  =  xq  (E-l) 

with  the  partitioned  form 


A11  !  A12 
i 

’x^t)' 

fjtjlft),  X2<t)] 

1 

> 

M 

I-* 

"o' 

i _ 

_x2(t)_ 

I 

-f-2tl(t)] 

where  x^t)  is  an  n-dimensional  vector,  Xg(t)  an  m-dimensional  vector, 
A^  an  n  xn  matrix,  A^2  an  n  x  m  matrix,  and  an  m  xn  matrix.  Let 
the  total  dimension  of  x(t)  be  &  =  n  +  m.  Equation  (E-2)  has  a  special 
uncoupled  nature  in  that  Xg(t)  is  driven  by  only  x^(t).  It  has  the  same 
structure  as  Eq.  (4.2-40)  without  the  forcing  terms;  the  effects  of  the 
latter  will  be  investigated  subsequently. 

Our  purpose  is  to  establish  conditions  for  which  stability  of  the 

equations 

z(t)  =  Xz(  t) 
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implies  local  stability  of  Eq.  (E-l);  in  particular  we  are  interested  in  the 
case  where  A  has  some  eigenvalues  at  the  origin  of  the  complex  pl&ue. 

The  results  of  this  investigation  are  summarized  in  the  following  theorem: 

Theorem 


Suppose  the  system  described  by  Eq.  (E-2)  satisfies 
the  conditions: 

(a)  The  eigenvalues  si?  i  =  1, 2  ox  A  satisfy* 

Re^)  <  0  ;  i  =  1, . r 

s.  =  0  ;  i  =  r  +  1, .  .,  t, 

That  is  to  say,  the  nonzero  eigenvalues  of  A 
have  negative  real  parts  and  there  are 
l  -  r  -  q  eigenvalues  with  the  value  zero. 

(b)  Rank  (A12)  =  m-qi  1 

(c)  If  |x2 1  =  a ,  then  there  exists  a  y  >  0  such  that 


Then  it  follows  that  the  solution  to  Eq.  (E-l)  is  locally 
stable,  and  furthermore  locally  we  have 

lim  Jx^(t)|  =  0  (E-3) 

t  -*co 


* 

Re(s)  denotes  the  real  part  of  the  complex  number  s. 


t 

-wife 
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The  proof  of  this  theorem  is  similar  to  that  given  by  Coddington 
and  Levinson  (Ref.  71)  for  the  case  when  the  eigenvalues  of  A  are  strictly 
in  the  left-half  complex  plane.  The  hypotheses  of  the  theorem  are  moti¬ 
vated  by  the  various  steps  required  to  verify  Eq.  (E-3).  It  is  clear  that 
in  general  Eq.  (E-3)  must  hold  for  stability  because  the  state  Xg(t)  in 
Eq.  (E-2)  is  given  by 


t 

3^(t)  =  Xg(0)  +  f  [-^21  -i (T )  +-2(-l ))  dr  (E-4) 

0  L 


If  Xj(t)  does  not  approach  zero  sufficiently  rapidly,  x^(t)  will  tend  to 
become  arbitrarily  large  for  some  functions  f0(x  ■. ).  The  main  task  is 


to  demonstrate  that  the  zero  eigenvalues  of  A  do  not  contribute  to  the 
solution  for  x^(t). 


From  Appendix  A  we  know  the  solution  to  Eq.  (E-l)  satisfies 


t 

x(t)  =  $(t,0)x(0)+J  $(t,T)  1[x(t)]  dr  (E-5) 

0 

where  <C»(t2, tj)  is  the  transition  matrix  associated  with  A.  Write  $>(t,  0)  in 
partitioned  form 


*M) 


®  l 

•V 

-M 

h  1 

«•  i 

L.. . . 

®i2(t,°) ' 

>(t’0)  i 

•0*  1 
tSJ  1 

1 

S+  1 

3  i 

'  i 

t - 

(E-6) 


with  the  same  partition  dimensions  as  A  and  expand  Eq.  (E-5)  to  obtain  an 
an  expression  for  x  j(t), 


E-3 
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Xjft)  =  4»u(t,  0)^(0) +  4>12(t,0)x2(0) 


t 

+  j  1*11^’ T^-l  |^-l *-2(r)J  +  ^12(t,T)-2^-l(T|]|  ** 


(E-7) 


From  Eq.  (E-7)  it  is  clear  that  in  order  for  Eq.  (E-3)  to  be  satisfied,  it 
should  be  true  that 


lim  !$n(t,0)|  =  Urn  |*12(t,0)|  =  0  (E-8) 

t  -*  ®  t  *♦  °° 

where  1 $>|  is  the  matrix  norm.*  Otherwise  the  contributions  of  the  initial 
conditions  to  x^(t)  may  not  approach  zero.  Eq.  (E-8)  is  verified  below 
by  examining  the  detailed  construction  of  $(t,0)  under  the  hypotheses  of 
the  theorem. 

The  Laplace  transform  $(s)  of  $(t,0)  is  given  in  Section  A.  2  as 

*(s)  =  (Is -A)’1  (E-9) 

Denote  Fy(s)  as  the  element  in  the  i01  row  and  jth  column  of  $(s)  and 
define  the  cofactor  (Ref.  145)  associated  with  the  i-j^1  element  of  the 
matrix  (Is  -  A)  as  p^(s).  Recall  that 

Pjj(s)  -  (-l)i+i  Det  Uytslj  (E-10) 


3^ 

The  appropriate  definition  of  a  matrix  norm  |  A|  for  this  application  is 
(the  maximum  eigenvalue  of  ATA)1/2.  This  definition  is  "compatible" 
(Ref.  144)  with  the  euclidpan  norm  of  a  vector  |xj  =(xTx)i/2,  in  the 
sense  that 


Ax]  s  | A]  |x 


E-4 
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where  M„(s)  is  the  minor  (submatrix)  obtained  by  deleting  the  ith  row  and 
jth  column  of  (Is  -2).  From  the  definition  of  the  inverse  of  a  matrix 
(Ref.  145)  it  follows  that 

p..(s) 

F„(s) - ^ - —  {E  -11) 

lj  Det  (Is  -  A) 

Because  A  has  q  eigenvalues  at  the  origin  (hypothesis  (a)) 


Det  (Is  -  X)  =  sq  p(s) 


(E-12) 


where  p(s)  is  a  polynomial  with  strictly  left -half  plane  zeros  (i.  e. ,  none 
have  zero  real  parts).  Conditions  (a)  and  (b)  of  the  theorem  imply  that  the 
first  n  columns  of  A  are  linearly  independent,  both  of  each  other  and  of 
the  last  m  columns  in  A,  and  also  the  rank  of  A  is  equal  to  r.  Therefore, 


Rank 


(V0))  “ 


r  - 1; 


i  =  l, .  .  .a 


j  = 


(E-13) 


Consequently,  because  the  dimension  of  M..(s)  is  £-1, 


Det  (m. . j  -  scq..(s)  ; 


(i  =  1 


1,  •  •  ,  X 


j  =  1, . .  ,n 


c  s  (jt  -  1)  -  (r  -  1)  =  £  -  r  =  q 


(E -14) 


where  q..(s)  is  a  polynomial.  Substitution  of  Eqs.  (E-12)  and  (E-14)  into 
J 

Eq.  (E-ll)  produces 


F..(s)  =  (-l)1+js^c_q^ 
i r  p(s) 


i  =  1,  ...n 


j  1 » •  •  >  f> 


(E-15) 
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where  c-qaO.  Referring  to  Eqs.  (E-6)  and  (E-9),  each  element  of 

$  (s)  and  4>10(s)  is  given  by  F..(s)  in  Eq.  (E-15)  which  has  its  poles  — 
li  la  ij 

i.e.  ,the  zeros  of  p(s)  —  strictly  in  the  left -half  complex  plane.  This 
implies  that  the  elements  of  $^(t,0)  and  4>jg(t,0)  decay  exponentially  and 
therefore  the  associated  matrix  norms  have  exponential  bounds  (Ref.  34), 

I*um>| s  kie"trt) 

a  >  0  (E-16) 

|*12(t>0)|  5  *2e~0t  ) 

where  and  kg  are  some  positive  constants.  This  establishes  Eq.  (E-8); 
now  we  can  proceed  directly  to  the  proof  of  the  theorem,  following  Ref.  71. 

For  convenience  take  k^  =  kg  =  k  in  Eq.  (E-16).  Taking  norms 
of  both  sides  of  Eq.  (E-7)  and  substituting  Eq.  (E-16),  one  obtains 

|xx(t)|  s  ke-crt^|xj(0)|  +  1^2(0) | ) 

+  k  ^  e"Cr^t’T)(|i1(x1(T),X2(r))  l+llg^x^T))  dr  (E-17) 
0 

Hypotheses  (c)  and  (d)  of  the  theorem  imply  that  for  any  e  >  0,  there 
exists  an  a  and  a  6  such  that  if 


then 


(E -18) 


(E-19) 


E-6 
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The  first  condition  in  Eq.  (E-19)  follows  from  condition  (c)  of  the  theorem; 
in  particular  take 


a  ^ 


€ 

2ky 


(E-20) 


The  second  condition  in  Eq.  (E-19)  follows  from  condition  (d)  of  the 

theorem;  ;.e. ,  f9[x.  ]  -*  0  faster  than  x1 .  Substitution  of  Eq.  (E-19)  into 
—1  ^  —1 

Eq.  (E-17)  and  transposing  e"CT  produces 


t 

e°t|x1(t)|  *  k  (|-i(°)|  +  |-2(0)|)+  €  ^  e<=rX  |x1<t)|  dT  (E -21) 

0 


subject  to  Eq.  (E-18).  This  inequality  yields  (Ref.  71,  p315) 
\xx(t)\  s  k  ^1^(0) | +  |  x2(0)|  je-(CT-€)t 


Now  if  we  require 


e  <  cr 


|Xi(0)|  +  |x2(0)|  S  £ 

Eq.  (E-22)  becomes 

Ix^t)]  s  6e"^"e^  s  6 


(E-22) 


(E-23) 


(E-24) 


The  conclusion  is  that  for  sufficiently  small  initial  conditions  on 
the  state,  x^(t)  is  indeed  less  than  or  equal  to  6  as  required  by  Eq.  (E-18), 
but  still  subject  to  the  requirement, 

jx2(t)|  *  a 
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To  check  the  latter  condition,  we  calculate  a  bound  on  Xg(t).  Substitution 
into  Eq.  (E-4)  from  Eqs.  (E -18),  (E -19)  and  (E-24)  produces 


00 

x2(t)|  *  |x2(°>|+(|A2xl  +  4)  6  i  e'(,I'f)tdt  <E-2f) 


which  reduces  to 


x2(t)|  *  |x2(0)|+(|A21|+  2]c) 


(E-26) 


Given  the  quantities  a,  c,k,  |  |,  and  a  as  specified  by  Eqs.  (E-20), 

(E-23),  (E-16),  (E-2),  and  (E-16)  respectively,  one  can  choose  6  and 
1^2(0) I  sufficiently  small  so  that  by  Eq.  (E-26) 

|x2(t)|  *  a;  t  *  0 

Therefore  the  assumptions  in  Eq.  (E-18)  made  in  deriving  Eq.  (E-24)  are 
consistent  and  x^(t)  -  0  exponentially,  thus  proving  the  theorem. 

To  apply  the  theorem  to  Eq.  (4. 2-40)  neglecting  the  forcing 
terms,  make  the  identifications 


Xj(t)  =  6x(t);  x2(t)  =  6h(t) 

All  =  A'-!;o;  A12  =  '--m 

o  ■  o 

T  0  0i  •  • 

*21  =  ^'  B'=  •  •  •  • 


1 1  (*!<*).  *2<t))  =  -b6h(t)T  6x(t) 
J2(x2(t))  =  B  '  6  x(t)  cT  6x(t) 


(E-2  7) 


E-8 
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The  conditions  of  the  theorem  are  checked  as  follows: 


(a)  The  eigenvalues  of  A  are  determined  by  expanding 
Det  (Is  -  A).  Using  the  Schur  identities  for  the 
determinant  of  a  partitioned  matrix  (Ref.  146)  it 
follows  that 

Det(Is  -  A)  =  snDet(ls  -  An  -  |  A^J 

Substitution  for  A^i ,  Aj2>  and  A21  with  some 
algebraic  manipulation  produces 

Det(Is  -  A)  =  sn-1Det  (is  -  An)T(s) 

T(s)  =  s  +  ke  Gq(s) 


G0(s)  =  cT(ls-An)"1b 

where  k€  and  G0(s)  are  defined  as  in  Eqs.  (4.2-47) 
and  (4. 2-50).  Note  that  the  zeros  of  Det(Is  -  Ajj) 
are  cancelled  by  the  poles  of  T(s).  Now  we  assume 
that  the  adaptation  gains  (3/ and  the  quantity  n0  in 
Aj  1  can  be  chosen  so  that  T(s)  has  n  + 1  zeros 
located  strictly  in  the  left  half  complex  plane,  as 
discussed  in  Section  4.2.6.  Therefore 


r  =  n  +  1 
q  =  n  -  1 

(b)  From  the  structure  of  A12  m  Eq.  (E-27)  it  follows 
that 


Rank(Ai2)  =  1 

Using  the  results  from  (a)  above, 


m-q  =  n-(n-l)=  1  =  RankfA-^) 
Therefore  condition  (b)  is  satisfied. 
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(c)  Condition  (c)  is  satisfied  because 

|ll(sr52) |  s  H  l*ll  |— 2|  s  “y  |5jj| 

where  we  identify 

|*2(t)l  5  “ 

r  =  |b| 

(d)  Condition  (d)  of  the  theorem  is  satisfied  because 
J^Xj)  is  quadratic  in  Xj 

Therefore  Eq.  (4.2-40)  is  locally  stable  in  the  sense  defined  in  the  theorem, 
subject  to  the  condition  that  the  zeros  of  T(s)  have  negative  real  parts. 


The  above  development  provides  sufficient  conditions  on  the 
matrix  A  and  the  nonlinearities  f^  and  fg  hi  order  that  the  solution  to 
Eq.  (4. 2-40)  be  locally  stable,  neglecting  forcing  terms.  To  assess  the 
effect  of  the  latter,  modify  Eq.  (E-2)  as  follows 

x(t)  =  Kx{t)  +  d  +  _f(x(t))  (E-28) 

where  d  is  a  constant  forcing  term  added  to  account  for  the  corresponding 
quantity  in  Eq.  (4.2-40). 


Assume  Eq.  (E-28)  has  a  steady  state  solution  x  which  satisfies 

s 


AXg  +  d  +  _f(Xg)  =  0 

Define  a  new  variable  z(t)  by 

z(t)  =  xftJ-Xg  = 


(E-29) 


(E-30) 


l 

l 

I 

! 

I 

! 

? 

I 

! 

f 

I 

I 

1 

J 

1 

I 

I 

J 
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wnere  the  partitions  correspond  to  those  for  x(t)  in  Eq.  (E-2).  Substitution 
from  Eqs.  (E -29} and {E -30) into  Eq. (E-28)and  expandingthe  nonlinear  term 
about  x  _  produces 

—  b 


z(t)  =  Az(t)  + 


M 

dx 


z(t)  +  higher  order  terms 


(E -31) 


5s 


where  df/ax  is  the  matrix  with  its  i-jth  element  equal  to  afj/axj.  Now 
define  the  partitioned  matrix 


ax 


r«i 

SXj 

av 

£ 

1 

F 

*12 

u* 

II 

^2 

*_k 

> 

F 

22 

axj 

a-2j 

—s 

(E-32) 


The  result  of  combining  terms  in  Eq.  (E-31)  and  using  Eq.  (E-32)  is 

z(t)  =  (A  +  F)  z(t)  +  higher  order  terms  (E-3S) 

Using  Eq.  (E-27)  we  can  evaluate  the  partial  derivatives  in  Eq.  (E-32)  as 
follows: 


fix  1  + 
— s 


T\ 

5es£  )’ 


=  0 


The  higher  order  terms  in  Eq.  (E-33)  are  quadratic  functions  similar  toj^ 
and  ^  Eq.  (E-2).  Theiefore  from  the  above  relations  it  follows  that  the 


E-il 


THE  ANALYTIC  SCIENCES  CORPORATION 


unforced  nonlinear  equations  of  motion  for  z(t)  have  the  same  structure 
as  Eq.  (E-2)  and  the  stability  theorem  can  be  applied  to  the  matrix 
(A  +  F). 

This  concludes  the  investigation  of  conditions  for  the  stability 
of  Eq.  (E-2).  The  results  obtained  enable  one  to  make  some  qualitative 
statements  about  the  behavior  of  Eq.  (4.2-40)  as  described  in  Section 
4.2.6. 
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APPENDIX  F 

THE  STEADY-STATE  MATRIX  RICCATI  EQUATION 

In  determining  the  optimal  control  u(t)  for  a  linear  time  invariant 
system  which  minimizes  a  quadratic  performance  index 

*f 

J  =  \  fx(t)T  Qx(t)  +  u(t)T  Ru(t)  dt  (F-l) 

V  "  “  J 

subject  to  the  equations  of  motion, 

x(t)  =  Ax(t)  +  Bu(t);  x(t0)  =  3^  (F-2) 

the  matrix  Riccati  equation 

S(t)  =  -  S(t)  A  -  ATS(t)  +  S(t)  BR_1  BTS(t)  -  Q 

S(tf)  =  0  (F-3) 

has  an  important  role.*  Its  solution  enables  one  to  mechanize  the  optimal 
linear  feedback  control  law 

u(t)  =  -  R”1  BTS(t)  x(t)  (F-4) 

An  important  special  cjise  of  the  above  is  when  the  terminal  time  is  infinite, 
tf  =  «>,  so  that  S(t'  becomes  a  constant  S  which  is  the  solution  to  the  steady 
state  equation 

SA  +  ATS  -  SBR-1  BTS  +  Q  =  0  (F-5) 

See  Appendix  B 
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This  is  an  important  result  because  the  linear  control  law  requires  only 
constant  feedback  gains;  therefore  techniques  for  determining  S  are  of 
interest.  The  latter  are  the  subject  of  this  appendix.  Our  emphasis  will 
be  upon  methods  that  are  feasible  for  solving  Eq.  (F-5)  on-line  in  an  adap¬ 
tive  control  system  as  described  in  Section  5.4.  To  guarantee  that  there 
is  a  unique  positive  semidefinite  solution  for  S,  it  is  assumed  throughout 
this  discussion  that  Q  is  positive  semidefinite,  R  is  positive  definite, 
the  system  represented  by  Eq.  (F-2)  is  controllable,*  and  all  the  eigen¬ 
values  of  A  have  nonnegative  real  parts.  If  the  last  of  these  conditions 
does  not  hold,  Q  must  be  further  restricted,  as  discussed  in  Ref.  147. 

Because  Eq.  (F-5)  is  a  nonlinear  matrix  equation,  it  usually  has 
no  closed  form  solution  and  consequently  some  numerical  technique  must 
be  used  to  obtain  a  close  approximation  to  S.  Three  methods  described  in 
the  literature  are 

•  Numerical  Integration 

•  Newton’s  Method 

•  Eigenvector  Method 

The  essential  features  of  each  are  summarized  ana  then  comparisons  of 
their  relative  computational  complexity  are  made. 

Numerical  Integration  —  Perhaps  the  most  obvious  technique  for 
solving  Eq.(F-5)  is  to  integrate  Eq.(F-3)  to  a  nearly  steady  state  condition. 
It  has  been  established  (Ref.  147)  that  beginning  with  any  positive  semi¬ 
definite  value  of  S(tj)  and  integrating  Eq.  (F-3)  backwards  in  time,  the 
solution  for  S(t)  satisfies 


* 

See  Appendix  A  for  a  definition  of  controllability 
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lim  S(t)  =  S, 
t  -*  -00 

converging  exponentially.  That  is, 

|  S  -  S(t)  |  *  ae"CTt  (F-6) 

for  some  positive  constants,  a  and  o,  where  |  |  denotes  the  matrix  norm. 
The  significance  of  Eq.  (F-6)  is  that  over  an  interval  of  time  having 
length  A,  the  bound  on  the  error  in  the  solution  is  reduced  by  the 
factor  e-a^ 

max  iS-S(nA)|  £  e~ffA  max  |S-S((n-l) A)|  (F-7) 

In  the  terminology  of  numerical  analysis  (Ref.  148)  the  method  has  first 
order  convergence  properties. 

Newton's  Method  —  Newton's  method  is  an  important  recursive 
technique  for  solving  nonlinear  algebraic  equations  (Ref.  148).  Several 
authors  have  described  its  use  for  the  matrix  Riccati  equation 
(e.g. ,  Refs.  147  and  149)  in  Eq.  (F-5).  Applied  to  this  problem  it  results 
in  the  expression 

(a  -BR-1  b\J  \  +  ,  +Sk  +  ,(a  -  BR*1  b\)+  SpilL1  b\  +  Q  ■  0 

(F-8) 

Given  a  value  for  S^,  Eq.  (F-8)  is  a  solvable  linear  relation  in  Sfc+  ^  under 

^  is  symmetric,  the  total 

number  of  unknowns  to  be  determined  is 

n(n  +1) 

2 

If  the  recursion  begins  with  any  positive  semidefinite  matrix  SQ  such  that 
the  eigenvalues  of  the  matrix 


our  assumptions  on  Q,  R,  and  A.  Because  S, 
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-1  T 

A  -  BR  B  S0 


all  have  negative  real  parts,  then  the  sequence  {Sk}  generated  by  Eq.  (F-8) 
converges  to  the  solution  of  Eq.  (F-5)  (Ref.  147). 


With  respect  to  the  convergence  rate  of  this  technique,  it  is 
known  (Ref.  148)  that  as  Sfc  approaches  the  solution  S,  the  error  behaves  as 

|s-sk|  *  /  |s-sk.i|  (F-8) 


where  rj  is  a  positive  constant  less  than  one.  Compared  with  Eq.  (F-7) 

Eq.  (F-8)  indicates  a  much  more  rapid  convergence  rate  as  Sk  approaches  S 
because  the  factor  rp*  decreases  rapidly  with  k  whereas  e  aA  is  constant. 
Therefore  Newton's  method  exhibits  second  order  convergence  properties. 


Eigenvector  Method  -  The  third  technique  is  found  to  be  most 
efficient  when  the  dimension  of  S  is  large.  It  is  based  upon  the  fact  that 
S(t)  in  Eq.  (F-3)  can  also  be  determined  from  the  relations  (Ref.  97) 


•  -1  T 

^  i -BR  S 
» 


-Q  i  -A 


0(t) 


fl{0) 

n(T,t) 

S(t,T) 


=  I 

■ou(T,t)  ;  n12cr,t)  • 

VT.t)  ]  °22<T’t) 

=  -  022(T't>'1  °21(T’t) 


(F-9) 
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where  £1  is  a  2n  x  2n  matrix  (A  being  of  dimension  n),  O^T,!)  -- 
i,  j,  =  1, 2  —  is  an  n  x  n  matrix  and  S  is  regarded  as  both  a  function  of  t 
and  T.  The  solution  to  Eq.  (F-5)  is  obtained  from 


S  =  lim  S(T,t) 
T  -*  ® 


lim 

T-a> 


!-«22(T>trl°21(T’t) 


(F-10) 


or  alternatively 


S 


lim 
t  -*  -® 


-°22(T-t) 


-1 


‘WT’t) 


(F-ll) 


Because  the  differential  equation  for  0(t)  is  time  independent,  we  can  take 
T  =  0  in  Eq.  (F-ll)  so  that  S  can  be  determined  by 


S  = 


lim 
t  -*  -00 


(F-12) 


The  details  of  the  method  involve  determining  the  eigenvalues 
arid  eigenvectors  of  the  matrix 


•  -IT 
A  ,  -BR  B 


Q  i  -A 


which  permit  the  limit  in  Eq.  (F-12)  to  be  easily  calculated.  Several 
algebraic  steps  must  be  completed  and  they  are  described  in  Refs.  150  and 
151.  For  large  values  of  n  (n  =  the  dimension  of  S),  less  computation  time 
is  required  for  this  procedure  than  for  Newton’s  method.  Newton's  method 
requires  inversion  of  a  matrix  whose  dimension  is  (l,/2)(n)(n  -•-1)  and  the 
associated  computation  time  is  proportional  to  n®;  furthermore  when  n  is 
large  or  if  the  matrix  to  be  inverted  is  ill-conditioned  (Ref.  144), 
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excessive  numerical  errors  may  be  incurred.  By  contrast,  the  computation 
time  required  for  the  eigenvalue  method  is  proportional  to  n^„ 

For  tactical  missile  applications,  Newton’s  method  seems  to  be 
a  feasible  procedure  for  solving  Eq.  (F-5).  The  order  of  the  system  is 
sufficiently  low  so  that  the  required  matrix  inversion  can  be  accurately 
accomplished  and  the  associated  programming  instructions  are  less  com¬ 
plicated  than  those  required  by  the  eigenvalue  method.  The  total  compu¬ 
tation  time  required  to  obtain  a  solution  in  the  cases  considered  in  Chapter  9 
is  roughly  l/lOO^1  of  that  required  by  the  numerical  integration  technique. 
The  integration  technique  has  also  been  compared  with  the  eigenvector 
method  (Ref.  151);  for  the  cases  reported  the  former  typically  requires  a 
computation  time  100  times  longer  than  the  latter  to  obtain  a  solution  for  S. 
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APPENDIX  G 

ADJOINT  THEORY  FOR  LINEAR  SYSTEMS 


When  designing  a  control  law  u(t)  for  any  known  system  of 

equations, 


x(t)  =  f(x(t),  u(t)) 


it  is  desirable  to  be  able  to  analyze  the  system  performance  for  various 
choices  of  u(t)„  This  can  always  be  done  numerically  simply  by  integrat¬ 
ing  the  equations  of  motion.  Different  sets  of  initial  conditions  and  types 
of  measurement  errors  can  be  tried  to  obtain  representative  simulations 
of  actual  system  operation.  In  general,  a  great  deal  of  computational 
labor  is  involved  in  examining  a  satisfactory  number  of  cases  to  obtain  a 
measure  of  "average"  system  behavior.  However,  if  the  equations  of 
motion  are  linear,  the  task  of  performance  evaluation  can  be  greatly  sim¬ 
plified  by  the  use  of  sensitivity  functions  which  analytically  determine  the 
effects  of  initial  conditions  and  measurement  errors.  These  functions  are 
derived  here  using  so-called  "adjoint  theory"  for  linear  systems. 


G.l  PROBLEM  FORMULATION 

Consider  the  linear  dynamical  system 

x{t)  =  A(t)x(t)  +  B(t)u(t):  x(t0)  =  xq  (G-T) 

where  u(t)  is  a  linear  feedback  controller  given  by 

u(t)  =  K(t)  x(t)  (G-2) 
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and  K(t)  is  a  specified  matrix.  Define  three  performance  measures  for 
the  system  as  follows: 

Jf  =  x(tf)T  F  x(t|) 

(G-3) 

tf 

J  =  (  x(t)T  Q(t)  x(t)  dt 
-  *o” 

(G-4) 

tf 

Ju  =  u(t)T  R(t)  u(t)  dt 

(G-5) 

"O 


where  F,  Q(t),  and  R(t)  are  assumed  to  be  symmetric,  positive  semi- 
definite  matrices.  The  quantity  Jf  is  a  measure  of  the  magnitude  of  the 
terminal  state  at  time  tf,  Jx  indicates  the  size  of  the  state  along  a  solu¬ 
tion  to  Eq.  (G-l)  and  Ju  represents  the  amount  of  control  ’'effort”  expended. 
These  are  all  familiar  quadratic  performance  indices  used  in  optimal  con- 
trol  theory.  Our  purpose  here  is  to  indicate  how  Jf,  Jx,  and  Ju  can  be 
evaluated,  given  any  linear  feedback  control  law,  in  terms  of  initial  con¬ 
ditions  x(t0)  on  the  state  and  measurement  errors  incurred  in  implement¬ 
ing  Eq.  (G-2). 


G.2  TERMINAL  STATE  SENSITIVITY  TO  INITIAL  CONDITIONS 

To  determine  the  sensitivity  of  the  terminal  state  to  initial  con¬ 
ditions,  rewrite  Eqs.  (G-l)  and  (G-2)  as 

x(t)  =  A(t)  x(t);  x(t0)  =  x0 

A(t)  =  A(t)  +  B(t)  K(t)  (G-6) 

* 

See  Appendix  B. 


G-2 


THE  ANALYTIC  SCIENCES  CORPORATION 


We  know  from  linear  system  theory  that  the  transition  matrix  $>(t,t0) 
associated  with  A(t)  satisfies 

$(t,t0)  =  A(t)  $(t,t0);  $(t0,to)  =  I  (G-7) 

Knowledge  of  $  obtained  by  integrating  Eq.  (G~7)  permits  us  to  write 

x(t)  =  $(t,t0)xQ 

That  is  x(t)  can  be  determined  for  any  values  of  t  and  xQ,  given  the  solution 
to  Eq.  (G-7)  for  a  particular  initial  time  t0.  Consequently  we  can  regard 
the  state  as  a  function  of  both  tQ  and  t,  written  as  x(t,tQ).  However,  in 
many  applications  it  is  desirable  to  know  the  terminal  state  resulting  from 
initial  conditions  imposed  on  the  system  at  different  initial  times.  For 
example,  this  is  the  case  in  the  missile  guidance  problem  discussed  in 
Chapter  11  where  the  effects  of  launching  the  missile  at  different  ranges 
relative  to  the  target  are  to  be  evaluated.  In  other  words,  if  x(t)  is  the 
state  at  any  initial  time  t  <  tf,  one  is  interested  in  evaluating 

x(tf,t)  =  $>(tf,t)x(t)  (G-8) 

Accomplishing  this  for  a  range  of  values  of  t  requires  knowledge  of  4>(tf,t) 
where  the  initial  time  is  regarded  as  an  independent  variable.  Notice  that 
x(tj,t)  is  obtained  from  x(t,t0)  by  making  the  change  of  variables  —  t — tf 
andt0- 1. 


To  obtain  $(tf,t)  from  Eq.  (G-7)  requires  that  a  complete  inte¬ 
gration  of  the  matrix  differential  equation  be  performed  over  the  interval 
(t , tf )  for  each  initial  time  t  of  interest.  Often  this  must  be  done  numeri¬ 
cally  and  a  great  deal  of  computational  labor  is  entailed.  However,  the 
$ 

See  Appendix  A. 
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procedure  can  be  greatly  simplified  if  a  differential  equation  for  $>(tf,  t) 
itself  can  be  determined  with  t  as  the  independent  variable.  To  derive  an 
equation  for  4>(tf,t),  we  begin  with  the  property 

*(tf,tf)  =  *(tf,t)*(t,tf)  =  I 

Differentiate  both  sides  of  this  expression  to  obtain 

*(tf,t)*(t,tf)  +  *(tf>t)*(t,tf)  =  0 

and  therefore,  using  the  fact  that  $>(t,tf)"l  =  4>(tf,t), 

*(tf,t)  =  -*(tf,t)  6(t, tf )  *(tf,t)  (G-9) 

Now  observe  that  the  differential  equation  for  S>(t,t0),  Eq„  (G-7),  can  be 
written  as 


4>(t,tf)  =  A(t)*(t,tf)  (G-10) 

because  Eq.  (G-7)  holds  for  any  t0  —  in  particular  t0  =  tf.  Substitution 
from  Eq.  (G-10)  into  Eq.  (G-9)  produces  the  desired  equation  for  $(tf,t): 

*(tf,t)  =  -$(tf,t)A(t) 

*(tf,tf)  =  I  (G-ll) 

Equation  (G-ll)  is  said  to  be  adjoint  to  Eq.  (G-7)  and  #(tf,t)  is  referred  to 
as  a  sensitivity  function. 

In  the  missile  guidance  application,  only  one  element  of  the 
terminal  state  vector  is  of  interest,  the  miss  distance.  This  can  be  ob¬ 
tained  from  knowledge  of  only  one  row  of  the  transition  matrix,  taken  with¬ 
out  loss  of  generality  to  be  the  first  row.  Denoting  the  first  row  o t  4»(tf,t) 
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by  the  column  vector  g,  it  follows  from  Eq.  (G-ll)  that 


gM)  =  ~A(t) 1  <p(tf,  t);  ^tf,tf)  = 


1 

0 


0 


(G-12) 


To  calculate  $(tf,t)  and  ^(t^t),  Eqs.  (G-ll)  and  (G-12)  are 
integrated  backwards  in  time  from  t  =  tf.  Once  evaluated  these  sensitivity 
functions  determine  the  terminal  state  resulting  from  any  given  initial 
condition  x(t),  according  to  Eq.  (G-8).  The  performance  index  in  Eq. 
(G-3)  is  then  determined  by 


Jf  =  x(t)T  $(tf,t)T  F$(tf,t)x(t) 


(G-13) 


G,3  TERMINAL  STATE  SENSITIVITY  TO  MEASUREMENT  ERRORS 

Another  important  contribution  to  the  terminal  state  can  be 
produced  by  errors  in  implementing  the  linear  control  law  in  Eq.  (G-2). 
These  errors  may  result  from  imperfect  observations  provided  by  sensors; 
they  can  ofter;  be  represented  as  random  fluctuations  or  as  (possibly  time- 
varying)  bias  errors.  To  examine  such  effects,  suppose  that  the  control  law 
for  Eq.  (G-l)  is  given  by 


u(t)  =  K(t)  (x(t)+ €(t))  (G-14) 

where  c(t)  is  an  error  vector.  The  resulting  equations  of  motion  for  the 
linear  system  are 

x(t)  =  A(t)  x(t)  +  B(t)  K(v.)  c(t) 


G-5 


where  A(t)  is  defined  in  Eq„  (G-6).  According  to  linear  system  theory,  * 
the  terminal  state  at  time  tf  caused  by  the  action  of  c(t)  on  the  interval 
(t,tf),  with  zero  initial  conditions  on  the  state  at  time  t,  is  given  by 

X(tf,t)  =  J  *(t,,T)B(T)K(T)i(T)dr  (G-15) 

where  x(tf,t)  is  regarded  as  a  function  of  both  tf  and  t,  just  as  in  Eq.  (G-8). 
A  differential  equation  for  the  terminal  stale  is  obtained  simply  by  dif¬ 
ferentiating  Eq.  (G-15)  to  obtain 

x(tf,t)  =  -  *(tfft)  B(t)  K(t)  c(t) 

x(tf,tf)  =  0  (G-16) 

In  the  special  case  when  the  measurements  contains  only  bias 
errors,  i.e. ,  <e(t)  is  a  constant  £,  the  terminal  state  is  given  by 

x(tf,t)  =  ^(tf,t)e 
*(tf,t)  =  -  #(tf,t)  B(t)  K(t) 

\ 

Mtf,tf)=  0  (0-17) 

If  one  is  interested  in  only  a  single  element  of  x(tj),  then  only  one  row 
(which  can  be  taken  as  the  first)  of  tf(tf,t)  --  denoted  by  ^(tf,t)  --  need  be 
determined;  that  is, 

X^tfjt)  =  4)(tf,t)T€ 

i(tf,t)  -  -K(t)TB(t)%(tf,t) 

0(tf,tf)  =  0  (G-18) 

See  Appendix  A. 
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where  g(tf,t)  is  the  first  row  of  $(tf,t),  defined  by  Eq.  (G-12). 

All  of  the  integrations  in  Eqs.  (G-16),  (G-17)  and  (G-18)  are 
performed  backward  in  time  and  require  knowledge  of  the  transition 
matrix  $(tf,t);  the  latter  is  »3valuated  from  Eqs.  (G-ll)  or  (G-12)  which 
are  also  integrated  backward  in  time.  Therefore  the  sensitivity  functions 
for  initial  conditions  and  measurement  errors  can  be  generated  simul¬ 
taneously. 


Another  form  the  measurement  errors  e(t)  can  take  is  that  of  a 
random  process  (noise)  having  zero  mean  and  second  moments 

E {_c(ti )  e(t2)T}-  =  W(tj)  6(tx  -t2);  for  all  tx  and  t2  (G-19) 

where  W(t^)  is  a  known  positive  definite  matrix  and  6(tj  -t2)  is  the  unit 
Dirac  delta  (impulse)  function.  The  mean  value  of  the  terminal  state  pro¬ 
duced  by  this  noise  is  also  zero.  If  we  denote  the  second  moment  charac¬ 
teristics  of  the  state  by 

P(tf,t)  =  E  |x(tf,  t)  x(tf,t)T| 

where  E  {  }  denotes  mathematical  expectation,  then  it  can  be  shown  that 

P(tf,t)=  -$(tf,t)  B(t)K(t)  W(t)  K(t)T  B(t)T  $(tf,t)T;  P(tf,tf)  =  0  (G-20) 

This  equation  is  also  integrated  backward  in  time.  * 

With  the  terminal  miss  (or  its  statistics)  caused  by  measure¬ 
ment  errors  calculated  from  either  Eq.  (G-17),  (G-18),  or  (G-20),  the 
value  of  the  performance  index  Jf  is  readily  calculated.  For  example,  if 
♦  - 

P(tf,t)  is  called  the  state  covariance  matrix. 
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e_(t)  is  a  constant  bias  error, 

Jf  =  cT  *(tf,t)T  F *(tf,i)  (G-21) 

If  jf(t)  is  a  random  process,  the  expected  value  of  is  given  by 

E{jf}=  E-|x(tf , t)T  Fx(tf,t)|  =  Trace  (P(tf,t)F)  (G-22) 

where  Trace  (PF)  denotes  the  sum  of  the  diagonal  elements  of  the  matrix 
product,  PF. 


G.4  INTEGRAL-TYPE  PERFORMANCE  INDEX  SENSITIVITY 
TO  INITIAL  CONDITIONS 


In  this  section  we  want  to  determine  an  efficient  method  for  cal¬ 
culating  the  values  of  the  performance  indices  in  Eqs.  (G-4)  and  (G-5)  pro 
duced  by  initial  conditions  x(t).  Beginning  with  Eq.  (G-5),  Ju  can  be 
written  explicitly  as  a  function  of  tf  and  t  by  substituting  for  u(t)  from 
Eq.  (G-2)  and  regarding  t0  as  a  variable,  t.  Thus 


Ju(tf,t) 


R(t)  K(t)  x(t)  dr 


(G-23) 


Equation  (G-23)  can  be  expressed  in  terms  of  the  initial  conditions  x(t)  by 
substituting  the  relation  (see  Eq.  (G-8) 


x(t)  =  x(t  ,  t)  =  4>(T,t)x(t) 


to  obtain 


(G-24) 


(  **  ) 

Ju(tf,t)  =  x(t)T  I  J  4>(t, t)T  K(t)T  R(r)  Kvt) #(t, t) dT|  x(t)  (G-25) 
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where  x(t),  being  independent  of  r,  is  factored  outside  the  integration. 

Evidently  the  control  effort  in  Eq.  (G-25)  can  be  expressed  as 

Jjflfrt)  =  x(t)T  C Q(tf , t)  x(t)  (G-26) 


where  C^(t«;,t)  is  a  sensitivity  function  identified  as 


$(r,t)T  K(r)T 


R(t)  K(t)  $(t,  t)  dr 


(G-27) 


A  differential  equation  can  be  obtained  for  this  quantity  by  differentiating 
both  sides  of  Eq.  (G-27)  with  respect  to  t  producing 

tf 

Cu(t{,t)  =  )  ^4(T,t)TK(r)TR(T)K(T)4.(r,t) 

t 

+  4>(r, t)T  K(r)  R(t)  K(t)  4>(r,t)  dr 

-  <J>(t,t)T  K(t)T  R(t)  K(t)  *(t,t)  (G-28) 

Noting  that  $(t,t)  =  I  and  substituting  from  Eqs.  (G-ll)  and  (G-27)  into 
Eq.  (G-28)  yields 

Cu(tf,t)  =  -A(t)TCu(tf,t)-Cu(tf,t)A(t)-K(t)TR(t)K(^  (G-29) 

This  differential  equation  can  be  integrated  backward  in  time  from  the 
terminal  condition 

<ytf,tf)  =  o 
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to  determine  the  solution  for  the  sensitivity  function  in  Eq.  (G-26).  Once 
known,  it  can  be  used  to  evaluate  J  for  any  desired  initial  conditions. 

In  an  exactly  analogous  fashion,  a  sensitivity  function  C  (tf,t) 
can  be  derived  for  evaluating  in  Eq.  (G-4).  The  appropriate  equations 
are  listed  below  without  proof: - 


jx  =  x(tr  cyt^t)  x(t) 


/v  T 

A  /l\  A 


C  M)  =  -  A(t)  C  (tf,t)  -  C  (t^,  t)  A(t)  -  Q(t);  C  (tf,tf)  =  0  (G~30) 


Finally,  in  some  cases  it  is  desirable  to  evaluate  a  composite 
performance  index 

J  =  J.  +  J  +  J 
f  x  u 

in  terms  of  initial  conditions  x(t)  on  the  state.  This  can  be  accomplished 
from  the  relations 

J  =  x(t)T  C(t,,t)  x(t) 

C(tf,t.)  =  -  A(t)TC(tf,t)-C(t{,t)A(t)-Q(t)-K(t)TR(t)K(t) 

C(tf,tf)  =  F  (G-31) 

Using  the  sensitivity  functions  developed  in  this  appendix,  one  can 
evaluate  various  linear  control  laws  for  a  linear  dynamical  system.  The 
importance  of  these  functions  is  that  they  provide  an  efficient  method  of 
evaluating  quadratic  performance  indices  and  determining  the  terminal 
state  at  a  fixed  terminal  time  caused  by  initial  conditions  or  measurement 
errors  which  are  applied  to  the  system  at  a  variable  initial  time. 
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APPENDIX  H 
TRAJECTORY  ^ATA 


In  this  appendix  numerical  values  for  the  parameters  which 
define  equations  of  motion  for  a  missile  airframe  are  presented.  The 
reader  is  referred  to  Section  8. 1  for  definitions  of  terms. 


H.l  AIR-TO-SURFACE  TRAJECTORY  DATA 

One  set  of  data  applies  to  an  air-to- surface  trajectory  for  an 
aerodynamically  controlled  missile  and  is  summarized  in  Tables  H.l, 
H.2,  H.  3,  and  H.4.  The  aerodynamic  coefficients  defined  in  Eq.  (8.1-2) 
are  given  for  several  different  flight  conditions  in  Table  H.  1.  Included 
are  values  of  time  at  which  the  particular  flight  condition  occurs  along  a 
sample  trajectory.  The  corresponding  stability  derivatives  M^,  M^,  etc. 
which  appear  in  Eq.  (8.1-3)  are  listed  in  Table  H.2. 

The  elements  of  A  and  b  in  Eq.  (8.1-4), 


all  a12 

a23 

■  0  ' 

A  = 

a21  a22 

a23 

;  b  = 

bl 

0  0 

a33 . 

.b2. 

are  calculated  from  the  data  in  Tables  H.l  and  H.2;  their  values  are  tabu¬ 
lated  in  Table  H.  3.  When  simulations  with  continuously  varying  param¬ 
eters  are  performed,  values  of  A  and  b  at  times  between  those  listed  in 
Table  H.3  are  obtained  by  linear  interpolation,  i.e. , 
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TABLE  H.  1 

MISSILE  AIRFRAME  DATA  ALONG  AN 


AIR-TO-SURFACE  TRAJECTORY 


Flight  Condition 

1 

2 

3 

n 

’ 

5 

6 

n 

Time  (seconds) 

0-5 

6 

8 

23 

62 

71 

— 

80 

Altitude  (feet) 

35,000 

35,000 

35,000 

0 

0 

0 

Velocity  (ft/sec) 

683 

1459 

2920 

2920 

2791 

2791 

2791 

Dynamic  Pressure  (psf) 

171 

786 

3146 

3146 

9258 

9258 

9258 

Mass  (slugs) 

47.2 

42.6 

28.6 

34.2 

33.8 

33.1 

27.4 

Iyy  (slug-ft2) 

715 

664 

497 

506 

482 

441 

432 

CNa 

7.28 

9.22 

8.14 

8.14 

8.31 

8.31 

8.31 

cn6 

4.53 

4.18 

1.49 

1.49 

1.95 

1.95 

1.95 

cMq 

-86 

-295 

-222 

-222 

-225 

-225 

-225 

CM„ 

-5.82 

-4.24 

-.597 

-.879 

-.515 

-1.27 

-.665 

CM  6 

-17.8 

-18.1 

-7.40 

-7.45 

-9.21 

-9.39 

-9.25 

Constants:  S=  1.23  ft4;  d=  1.25  ft. 


TABLE  H.2 

STABILITY  DERIVATIVES  ALONG  AIR-TO-SURFACE  TRAJECTORY 


Flight 

Condition 


-0.029  -  0.230  -  0.462  -  0.455  -  1.49  -  1.63  -  1.66 

-2.14  -  7.72  -  5.61  -  8.40  -  15.2  -  41.0  -  21.9 

-6.56  -33.0  -72.0  -71.2  -272.0  -303.0  -305.0 

0.0478  0.144  0.379  0.379  1.01  1.03  1.24 

0.0298  0.0652  0.0699  0.0699  0.237  0.242  0.292 


I 


THE  ANALYTIC  SCIENCES  CORPORATION 


TABLE  H. 3 

ENTRIES  IN  A  AND  b  (Eq.  (H-l)) 


Flight 

Condition 

1 

2 

3 

4 

5 

6 

7 

all 

-  0.029 

-  0.230 

0.462 

-  0.455 

1.49 

1.63 

1.66 

a12 

-  0.065 

-  0.0367 

0.00526 

-  0.0054 

0.0054 

0.0142 

0.0063 

a13 

-  5.23 

-  29.5 

-  70.9 

-  69.7 

-  268.0 

-  293.0 

-  300.0 

a21 

32.7 

210.0 

1100.0 

926.0 

2817.0 

2880.0 

3470.0 

a22 

-  0.0478 

-  0.144 

0.378 

-  0.317 

1.0 

1.03 

1.24 

a23 

-1015.0 

-4760.0 

-10200.0 

-8486.0 

-33100.0 

-33900.0 

-40800. 0 

a33 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

b2 

1015.0 

476C.0 

10200.0 

8486.0 

33100.0 

33900.0 

40800.0 

b3 

50.0 

50.0 

50.0 

50.0 

50.0 

50.0 

50.0 

TABLE  H.4 

AIRFRAME  UNDAMPED  NATURAL  FREQUENCY 
AND  DAMPING  RATIO 


Flight  Condition 

1  . 

2 

3 

4 

5 

6 

C 

0.026 

0.067 

0.172 

0.132 

0.305 

0.203 

0.296 

wn 

1.46 

2.78 

2.44 

2.92 

4.09 

6.54 

4.89 

A(t)  =  [A(ti)(ti  +  i-t)+A(ti  +  x)(t-ti) 
b(t)  =  [b(ti)(ti  +  1-t)+b(ti+1)(t-ti) 


L  ,  “t. 

1  +  1  1 


J  *i  + 1  “  *i 


(H-2) 


where  t ^  is  the  time  that  the  i^h  flight  condition  occurs. 
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Additional  important  quantities  are  the  three  open  loop  poles 
of  the  airframe  and  actuator.  One  pole  is  produced  by  the  actuator  and 
has  a  fixed  value 


Pj  =  -X  =  -50.0  sec 

The  other  two,  p2  and  Pg,  are  complex  conjugate  airframe  poles  which  are 
the  roots  of  the  second  order  polynomial 

2  2 
s  +  2Ca>  s  +  oj 
s  n  n 


where 

“n  =  alla22  '  al2a21  W  =  lp3' 

=  “all  “a22  =  "p2  "p3 

The  quantities  u>n  and  C  are  referred  to  as  the  undamped  natural  fre- 
quancy  and  the  damping  ratio  respectively;*  is  the  magnitude  of  p2 
and  C  is  the  cosine  of  the  angle  p2  makes  with  the  negative  real  axis 
in  the  complex  plane.  The  values  of  these  quantities  are  given  in 
Table  H.  4. 

H.2  SURFACE-TO-AIR  TRAJECTORY  DATA 

A  second  set  of  airframe  data  used  for  simulations  described  in 
this  report  is  for  a  surface-to-air  trajectory.  Airframe  dynamics  are 
specified  in  terms  of  the  transfer  function  T(s)  relating  commanded  con¬ 
trol  surface  deflection  to  normal  acceleration, 

* 

See  Ref.  21  for  definitions  of  these  terms. 
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T(s) 


(H-3) 


The  pole  pa  is  associatea  with  the  actuator  and  the  remaining  terms  re¬ 
present  the  airframe  rotational  dynamics.  Values  of  these  quantities 
together  with  the  time,  velocity,  altitude,  and  dynamic  pressure  profiles 
are  given  in  Table  H.  5.  In  order  to  convert  this  data  to  the  form  shown 
in  Table  H.2,  relationships  must  be  found  between  the  quantities  M^, 

Mg,  etc.,  and  the  variables  in  Eq.  (H-3).  This  can  be  done  using 
Eqs.  (8. 1-3),  (8. 1-4)  and  (8. 1-8)  with  the  result, 


L 

a 

Mq 

M 

a 

M« 


*>aV 


Z1  +  z2  +  2C“n 
Z1  +  Z2 


-L  M  -  or 
a  q  n 


—  (m  +  Z1i 

j  \  a  1 
a 


(H-4) 


Substitution  of  these  quantities  into  Eq.  (8. 1-3)  permits  the  calculation  of 
the  elements  defined  in  Eq.  (H-l);  their  values  are  entered  in 
Table  H. 6. 


The  data  supplied  for  these  two  trajectories  has  been  obtained 
by  fitting  sets  of  parameters  for  actual  missile  airframes  to  hypothetical 
trajectories.  Consequently,  although  the  behavior  of  the  airframe  param¬ 
eters  is  qualitatively  representative  of  actual  flight  conditions  and  is 
adequate  to  test  feasibility  of  various  control  methods,  the  specific 
numerical  values  are  not  real  flight  data. 
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TABLE  H.  5 

MISSILE  AIRFRAME  TRANSFER  FUNCTION 
PARAMETERS  FOR  SURFACE-TO-AIR  TRAJECTORY 


Flight  Condition 

1 

2 

3 

4 

5 

6 

k 

6750.0 

27900.0 

39600.0 

48800.0 

51300.0 

41600.0 

Z1 

-  10.9 

-  45.0 

-  68.4 

-  90.9 

-  99.2 

-  90.2 

z2 

9.98 

40.4 

61.7 

82.3 

90.1 

82.5 

“n 

8.50 

17.2 

20.5 

22.9 

23.4 

21.4 

C 

0.0471 

0.095 

0.139 

0.131 

0.138 

0.128 

Pa 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

50.0 

Velocity  (ft/sec) 

500.0 

2059.0 

3124.0 

4031.0 

4470.0 

4330.0 

Dynamic  Pressure  (psf) 

297.0 

4482.0 

6180.0 

4755.0 

3618.0 

1415.0 

Altitude  (thous.  ft.) 

0 

4.0 

20.0 

40.0 

50.0 

68.4 

time  (secs) 

0 

10.0 

20.0 

30.0 

35.0 

_ 

45.0 

82.7 

76.2 

19.3 

0.121 

50.0 

4220.0 

640.0 

83.6 

55.0 


TABLE  H.  6 

ENTRIES  IN  A  AND  b  (Eq.  (H-l))  FOR 


SURFACE-TO-AIR  TRAJECTORY 


Flight 

Condition 


4.60 

6.70 

8.59 

9.10 

7.70 

6.50 

0.0729 

0.0463 

0.0357 

0.038 

0.0307 

0.0296 

-  256.0 

-  382.0 

-  526.0 

-  544.0 

-  443.0 

-  352.0 

3970.0 

8770.0 

13900.0 

16900.0 

14000.0 

12500.0 

1.93 

2.81 

3.46 

3.79 

3.25 

2.85 

-27900.0 

-39700.0 

-48800.0 

-57350.0 

1 

-41640.0 

-33700.0 

-  50.0 

50.0 

-  50.0 

-  50.0 

-  50.0 

-  50.0 

27900.0 

39700.0 

48800.0 

57350.0 

41640.0 

3  3700  .0 

50.0  j 

i 

50.0 

50.0 

50.0 

50.0 

50.0 
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